SCATTERING FOR THE CUBIC KLEIN GORDON EQUATION 
IN TWO SPACE DIMENSIONS 
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Abstract. We consider both the defocusing and focusing cubic nonlinear 
Klein— Gordon equations 



in two space dimensions for real- valued initial data u(0) £ H\ and ut(Q) S L^,. 
We show that in the defocusing case, solutions are global and have finite global 
L| spacetime bounds. In the focusing case, we characterize the dichotomy 
between this behaviour and blowup for initial data with energy less than that 
of the ground state. 

These results rely on analogous statements for the two-dimensional cubic 
nonlinear Schrodinger equation, which are known in the defocusing case and 
for spherically-symmetric initial data in the focusing case. Thus, our results 
are mostly unconditional. 

It was previously shown by Nakanishi that spacetime bounds for Klein- 
Gordon equations imply the same for nonlinear Schrodinger equations. 
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1. Introduction 



We consider the cubic nonlinear Klein-Gordon equation 



i + nu 6 = 0, 
±1 with [i = 



Uu — Am - 

for real- valued u : R t x R 2 — > R. Here \x 
equation and /i = -1 as the focusing case. 

We consider initial data in the energy space u(0) € H x and u t (0) € L 2 
precisely the set of initial data for which the energy 



(1.1) 

1 known as the defocusing 
This is 



E(u) := E(u(t),u t {t)) 



±\u t (t,x)\ 2 + ±\Vu(t,x)\ 2 + ±\u(t,x)\ 2 + !± |u(i,x)| 4 cfe 

(1-2) 

is finite. The energy is conserved. 

Definition 1.1 (Solution). We say that a functio n u :/xl 2 ->Kona non-empty 
time interval € I C R is a (strong) solution to (1.1) if (u,u t ) € C®(K\H\ x L x ), 



u 6 Lf X (K x M 2 ) for all compact K d I, and u obeys the Duhamel formula 



u(t) 
«t(t) 



«(0) 
«t(0) 



(1.3) 



u 3 (s) ds 



cos((V)i) (V^sin^V)*) 
(V)sin((V)t) cos((V)i) 

^-^((VXt-a))- 
cos((V)(i - a)) 

for all t £ I. We refer to the interval / as the lifespan of u. We say that u is a 
maximal-lifespan solution if the solution cannot be extended to any strictly larger 
interval. We say that u is a global solution if I = R. 

Our main goal is to prove that global strong solutions exist and have finite 
spacetime norms. As the equation is energy-subcritical, global well-posedness fol- 
lows easily in the defocusing case. In the focusing case, this is known not to be 
true for arbitrary initial data. Indeed, explicit counterexamples are known. Let Q 
denote the unique positive radial H X (R 2 ) solution to 

AQ + Q 3 = Q, (1.4) 

which is known as the grou nd state and is an optimizer in the Gagliardo-Nirenberg 
inequality. ( See subsection 2A for more about this.) Then u(t, x) — Q(x) is a static 
solution to (IT). In particular, it does not have finite global spacetime bounds nor 
does it scatter to free waves at future/past infinity. Moreover, finite-time blowup 
occurs for a large class of initial data that are slightly larger, including the case 
u(0) = (1 + e)Q and u t (0) — for any e > 0. This is proved by the method of 
Payne and Sattinger Q ; see Theorem L6 below and Section || for more details. 
Nevertheless, it is believed that Q is the minimal counterexample to the existence 
of spacetime bounds, in a certain sense. 

In the focusing case, it is not appropriate to measure the 'size' of the initial 
data purely in terms of the energy because of the negative sign appearing in front 
of the potential energy term. Indeed, the energy of the solution with initial data 
u(0) = (1 + e)Q and u t (0) — for any e > is strictly less that that of the static 
solution Q. For this reason we introduce a second notion of size, namely, the mass: 



M(u(t)) :-- 

Unlike the energy, this is not conserved. 



K 2 



\u(t, x)\ 2 dx. 



THE CUBIC KLEIN-GORDON EQUATION IN TWO SPACE DIMENSIONS 



3 



These considerations, together with our assertions regarding the defocusing case, 
can be summarized as follows: 

Conjecture 1.2 (NLKG Conjecture). Fix n — ±1. Let {uq,u\) € H x x L x and in 

the focusing case assume also that A1(uq) < M(Q) and E(uq,ui) < E(Q). Then 
there exists a global solution u to ( p_.l| ) with initial data u(0) = uq and ut(0) = u\. 
Moreover, this solution obeys global spacetime bounds 

\W\\l?hi + \\iit\\ L fLl + W u \\lI„ ^ C(£(uo,ui)). (1.5) 

As a consequence, the solution scatters both forward and backwards in time, that 
is, there exist (u^iif) € H% x L x such that 

cos((V)t) (V} _1 sin((V)t)" 
(V)sin«V)t) cos«V)t) 



uit) 

yU t {t) 

in Hz x Lz, as t — » ±oo 



The main point here is the L\ x spacetime bound and concomitant proof of 
scattering. Global well-posedness was known previously. In the defocusing case, it 
is a simple consequence of energy conservation — the equation is energy-subcritical. 
In the focusing case, our hypotheses are sufficient to imply control of the L^°(H X x 
L x ) norm and hence also global well-posedness; the key ingredient here is the sharp 



Gagliardo-Nirenberg inequality. See the discussion in subsection 2.4 



Nakanishi 1 33 has shown that spacetime bounds for the complex- valued Klein- 
Gordon equation imply spacetime bounds for the corresponding nonlinear Schro- 
dinger equation (in the mass- and energy-critical settings and all dimensions). By 
employing the X s - b arguments we describe in Section^, one may adapt his proof to 
the case of the real- valued Klein-Gordon equation discussed in this paper. The key 
observation behind these results is that the Klein-Gordon equation degenerates 
to the Schrodinger equation in the non-relativistic limit. Thus any solution to 
NLS can be used to produce a solution to Klein-Gordon by suitable rescaling and 
other minor modifications. We caution the reader that this produces a very narrow 
subclass of solutions to Klein-Gordon and that the long-time behaviour of the two 
types of solution is not identical. 



To summarize, a resolution to Conjecture 1.2 implies a positive answer to the 
analogue for NLS, specifically, 

Conjecture 1.3 (NLS Conjecture). Fix /j, = ±1. Let wq E L x (M. 2 ) and in the 
focusing case assume also that M(wq) < M(Q). Then there exists a unique global 
solution w to 

iwt + Aw = /i\w\ 2 w (1-6) 
with w(0) = Wq. Furthermore, this solution satisfies 

\\ w \\Li (RxR 2 ) 

< C(M(w )) 

for some continuous function C. As a consequence, the solution w scatters both 
forward and backwards in time, that is, there exist w± € L? x such that 

||iu(t) - e ItA w±|| L 2 -> as t — !> ±oo. 

This conjecture has been proved in the defocusing case |l| and for spherically 
symmetric data in the focusing case (22). (See also O] and J25] for these results 
in higher dimensions in the defocusing and focusing cases, respectively.) The ideas 
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developed in this paper are not sufficient to resolve the remaining case of either 



conjecture. Nevertheless, we will prove that resolution of Conjecture f.3 is the 
only obstruction to spacetime bounds for NLKG. As NLS only represents the non- 
relativistic limit, this means providing spacetime bounds for all solutions except 
those living at astronomic length-scales. We contend that it is more natural to 
treat these excluded solutions in the NLS setting because scale invariance is restored 
there. 

Here at last is the precise statement of the main theorem in this paper. 



Theorem 1.4 (Spacetime bounds). Fix fi = ±1 and assume that Conjecture 



holds for this choice. Then Conjecture l.i holds 



As mentioned above, Conjecture 1.3 has been resolved except for the focusing 
case with non-radial data. As a consequence, our results are unconditional in all 
but this case: 



Corollary 1.5. Conjecture \l.3l holds in the defocusing case and for spherically 
symmetric data in the focusing case. 



We have chosen to present our principal result as Theorem 1.4 since it more 
honestly represents what is achieved in this paper. It also reiterates our belief that 
the natural way to attack dispersive PDE with broken symmetries is first to treat 
independently the limit cases where the symmetries are restored. 

In Section ^, we show that the hypotheses in the focusing case are sharp in the 
following sense: 



Theorem 1.6 (Blowup). Let u be a maximal-lifespan solution to (1.1) in the fo- 
cusing case with initial data obeying 

E(u) < E(Q) and M(u(0)) > M(Q). 

Then the solution u blows up in finite time in at least one time direction. 

Both our well-posedness and blowup results for the focusing Klein-Gordon equa- 
tion constrain the behaviour of the mass and the energy, while in the NLS setting, 
the natural conjecture considers only the mass. In connection with this, we note 
that there are Schwartz-space solutions to focusing NLKG with mass that is ar- 
bitrarily small uniformly in time but do not scatter (or admit global spacetime 
bounds). For instance, given v 6 R 2 , 

u v (t,x) := Q(x ± + (is)x^ — vi), where x — x" + x ± and x^~ _L u, 

is a solution to the focusing case of ( |l.l| ) with mass M{u v (t)) = (v)~ 1 M (Q), which 
can be made arbitrarily small by sending v — > oo . This example arises from the fact 
that Lorentz boosts do not preserve the mass. By comparison, the corresponding 
Galilei symmetry of NLS does conserve the mass and so this phenomenon does not 
occur. 



The analogue of Conjecture 1.2 for focusing nonlinearities of the form \u\ p u 
with p > 2 was resolved recently in Jl6| , the defocusing case having been treated 
previously in To properly explain the relation between these works, our efforts 
here, and other work in higher dimensions, we need to introduce the notion of 
criticality; the fact that our equation does not possess a scaling symmetry makes 
this a little more awkward than in related problems such as the Schrodinger and 
wave equations. 
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The dispersion relation — ui = (£) := (1 + l^l 2 ) 1 / 2 for the free Klein-Gordon 
equation has two natural scaling limits: the wave equation at high frequencies 
and the Schrodinger equation at large length scales. We deliberately did not refer 
to low frequencies here because the action of Lorentz boosts means that a solution 
characterized by a large length scale may actually be centered around any frequency 
it wishes. (As we will see, Lorentz boosts will be a source of particular vexation in 



the treatment of (1.1); this is a phenomenon that is peculiar to our nonlinearity as 
compared to \u\ p u with p > 2.) 

Both the Schrodinger and wave scaling limits restore scale invariance and hence 
define a notion of criticality, indeed, a common notion of criticality: 

The nonlinearity ± \u\ p u is ij| c (K d ) critical precisely when s c = | — |. 



On the other hand, the energy (L2) controls both L x and H x , at least in the 
defocusing case, and hence all intermediate Sobolev spaces. In two space dimen- 
sions, < s c < 1 corresponds to nonlinearities with p > 2. Note that the case 
p = 2 is L^-critical, while there is no ij^-critical nonlinearity of power-type in two 
space dimensions. In d > 2 dimensions, is L^-critical and known as the mass- 

critical case, while l^l 3 ^ u j s iji-critical and is known as the energy-critical case. In 
two dimensions the name energy-critical is used for exponential-type nonlinearities 
inspired by inequalities of Trudinger-Moser type; see [||[ |l6| . 

For simplicity, let us begin with some historical remarks in the defocusing case 
in dimensions d > 3. For both the Schrodinger and Klein-Gordon equations in 
the inter-critical regime % < p < an d data in the energy space, scattering 
was proved some time ago; see |Q and references therein, as well as |32], [l3| for 
simplified treatments. The case of energy-critical nonlinearities was first treated 
in the NLS setting in breakthrough work of Bourgain |4||. This paper was for 
radial initial data, the non-radial case being treated in [p[ |35|, [l6f and (non-radial) 
Klein-Gordon in |50|. The L^-critical case was resolved recently in see also 
|25| HJ], for earlier work in the radial case. For the latest on the focusing case 
in dimensions d > 3, see §, || || ||. 

For the Klein-Gordon equation in the case d = 2 discussed in this paper, scat- 
tering has been proved for data in the energy space in the inter-critical cases [^lj 
(defocusing) and [jl6| (focusing), as well as in the energy-critical case |ll| (defocus- 
ing) and (lfj (focusing). The remaining L^-critical case is the topic of this paper. 

As noted above, scattering in the inter-critical regime (for both the Klein-Gordon 
and the Schrodinger equations) was proved significantly before any progress was 
made in the critical cases. This is with good reason: the fact that conservation 
laws control both higher and lower regularity norms gives excellent control over all 
scale-invariant quantities. In the critical cases, one of the two controls becomes 
redundant / unusable — any attempt to incorporate it results in non-scale invariant 
(and hence patently ridiculous) statements. Of the two critical cases, we contend 
that the l? x -critical case is more difficult. Our reasons for making this claim, which 
go beyond the mere fact that one has been resolved and the other not, will become 
apparent when we present the outline of the proof. 

1.1. Outline of the proof. The key to proving scattering is to show finitcness of a 



global spacetime norm. The most natural choice in the case of (1.1) is L\ X (R x R ). 
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This is also the natural choice for the Z^-critical NLS and, in particular, is scale- 
invariant in that setting. For a general time interval I CM. we will use the notation 

Si(u) = \\uf LUlxM2) , (1.7) 

which we refer to as the scattering size of u on the interval /. We will also write 
S>t{u) to denote the scattering size on the the interval [t, oo ) an d similarly, S<t(u) 
for the interval (— oo,t]. The Strichartz inequality (Lemma |2.3| ) shows that Sk(u) 
is finite for solutions u of the linear Klein-Gordon equation. 

To prove the existence of a global solution u with <Sr (it) finite, we will employ an 
induction on energy / contradiction argument in the style of Kenig and Merle |fl8f . 
(See also j| |2p| f or some key steps in the development o f th is methodology, as well 



as the review [§§.) The idea is as follows: If Conjecture [L2 were false, there would 
be a sequence of solutions u n : I n x M 2 —> K for which 

E(u n ) — !> E c < oo but Si n (u n ) ->• oo. (1.8) 

In the focusing case we would also have 

limsupM(w n (0)) < M(Q) and E c < E(Q). 

n— >oo 

Without loss of generality we may choose E c to be the smallest number for which 
such a sequence exists, which is then called the critical energy; it is positive by virtue 
of the small-data theory expounded in Section ||. A key observation of Keraani 
(originally made in the mass-critical NLS setting) was that from this sequence one 
can extract a minimal counterexample to the conjecture. 

The proof of the existence of a minimal counterexample is not trivial. Non- 
compact symmetries provide an obvious means for a minimizing sequence to fail 
to have a convergent subsequence. For the Klein-Gordon equation these include 
space and time translations, as well as Lorentz boosts. While not a true symmetry, 
dilations (specifically, to large length scales in our case) also provide a manner 
in which the minimizing sequence may fail to converge. The usual method for 
studying variational problems with symmetry is the concentration compactness 
technique and this is what we will employ here. Recall from [^8) that concentration 
compactness presents us with three scenarios: compactness (the desired outcome) 
vanishing (the complete lack of concentration) or dichotomy (the splitting into two 
or more wave packets). 

T he k ey to disproving vanishing is an inverse Strichartz inequality (see Theo- 



rem L9), which shows that the scattering size cannot be large without possessing 
a bubble of concentration. When the critical regularity is positive, this type of 
result can be deduced from an inverse Sobolev embedding inequality. It was in 
this manner that concentration compactness techniques were first introduced in 
the dispersive setting; see ||. In the L^-critical setting, there is no possibility to 
involve Sobolev embedding and all known inverse inequalities rely on deep results 
in harmonic analysis, specifically, on progress toward the Restriction Conjecture of 
Stein, H^]. (Sections 4.2 and 4.4 of the lecture notes [^3) discuss inverse Sobolev 
and Strichartz inequalities, respectively, in a consistent manner and should aid the 
reader in making comparisons.) 

We will use the sharp bilinear restriction theorem of Tao, jh)| . The result is global 
for the paraboloid (the Schrodinger case), but only applies to compact subsets of 
other elliptic surfaces (for example oj + (£) =0, which is the dispersion relation 
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in our case). This necessitates several preliminary reductions: first to a frequency 
annulus (cf. Lemma |4.l[) and then to a narrow (unit width) sector inside this 
annulus (cf. Corollary ^.6| ). A Lorentz boost is then used to bring this tube into a 
fixed neighbourhood of the origin where Tao's result may then be applied. 

The necessity of incorporating Lorentz boosts in the discussion of the inverse 
Strichartz inequality is (as mentioned before) a peculiarity of the L^-critical case. 
L^-supercriticality of an equation prevents the characteristic frequency scale of the 
functions in the minimizing sequence from shrinking to zero. This reasoning also 
explains why previous work on scattering in the L^-supercritical regime has not 
had to directly address (or appeal to) the connection to NLS, namely, because NLS 
is only revealed when the characteristic frequency scale of a solution shrinks to 
zero. On the other hand, by playing boundedness of the energy against the H x 
scaling of Lorentz boosts, one can deduce that the boost parameters of minimizing 
sequences must stay bounded (that is, do not approach the speed of light). As the 
speed associated to any boosting of the minimizing sequence is then comparable to 
the variation in speed already found in the solution (that is, the variation in 
as £ varies over the Fourier support), the very existence of a boost becomes moot. 

In the L^-critical case, there is no lower bound on the characteristic frequency 
scale of a solution and, in particular, it may be much less than the (still bounded) 
parameter of a boost. This is why we must address the action of Lorentz boosts in 
this paper. 

We turn now to a brief discussion of the second unfavourable concentration 
compactness scenario: dichotomy. This will be excluded on the basis of the fact 
that we are dealing with a minimizing sequence (cf. Case II in Section 0). The 
key tools here are the decoupling results discussed in Section [| and the stability 
theory recorded in Section ||. Together they show that despite the fact that our 
equation is nonlinear, multiple wave packets act independently; hence, if a solution 
consisting of multiple wave packets has infinite scattering size, then so does one of 
the constituent wave packets. Passing to the single wave packet would then give a 
smaller value for E c appearing in flL.SD . Contradiction! 

On the basis of all we have discussed so far, we find ourselves in the desired 
(concentration) compactness scenario. In the NLS (or semilinear wave) context 
this would mean that, after applying symmetries of the equation to our minimizing 
sequence, we may exhibit a convergent subsequence and hence a minimal-energy 
blowup solution. In our case, however, one wrinkle remains: the Klein-Gordon 
equation does not possess scaling symmetry. As a consequence, the limiting minimal 
blowup solution may be a solution to NLS instead of NLKG! Indeed, this occurs 
whenever the characteristic length scale of our minimizing sequence diverges to 
infinity. (In the energy-subcritical regime, boundedness of the energy prevents 
divergence to zero.) 

Previously, Nakanishi ]3j| used this approximate embedding of NLS inside (com- 
plex) NLKG to show that the spacetime norm of a solution to the former provides 
a lower bound on the function C appearing in (1.5). In this way, he proved that 
spac etim e bounds for NLKG imply spacetime bounds for NLS. To use Conjec- 
ture L3 to prove scattering for NLKG, we need to do what is essentially the exact 
opposite: transfer upper bounds from NLS to NLKG. This reversal introduces two 
new aspects. First, for lower bounds one merely needs to control the quality of 
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the approximation for the amount of time it takes for the NLS solution to accu- 
mulate the majority of its scattering size; for upper bounds, we need to control the 
NLKG solution globally in time. In the long-time regime, the quality of the NLS 
to NLKG approximation deteriorates to an unsatisfactory degree. We deal with 
this issue by noting that by the time the approximation breaks down, the NLS 
solution has started emulating a solution to the linear Schrodinger equation and 
that consequently, the future linear Klein-Gordon evolution is small. This is the 



topic of Proposition S.6; the same philosophy was employed in [ |21| , which considers 
the embedding of NLS inside gKdV. 

The second new aspect is that to prove lower bounds, one may embed the NLS 
solution in whichever manner is convenient. For upper bounds, we must contend 
with all embeddings, including the possible incorporation of a Lorentz boost. 

A third difference between this work and p3[ stems from our decision to consider 
the Klein-Gordon equation for real-valued functions. This distinction is essentially 
irrelevant for most of the arguments, with the notable exception of the NLS/NLKG 
correspondence. In order to get an appreciation for the difference, we invite the 
reader to consider the nonlinearities 

|i>| 2 i> versus [Rei>] 2 Rei> 

for a plane wave v = e^"*'^*. The fact that the NLS/NLKG embedding is still 
valid is a testament to the fact that cos 2 (£x— (£)t) behaves sufficiently like a constant 
function; however, this resemblance cannot be captured using L\L r x norms. Instead, 
we need to use AT s ' h -inspired methods which capture the fact that the error terms 
present in this approximation do not exhibit spacetime resonance with the linear 
propagator. See also H ^l], ^lj for similar arguments in the NLS/gKdV context. 

The details of how these hurdles are overcome may be found in Section Ul- 
timately, we show that if the characteristic length scale of a minimizing sequence 
were to diverge (to infinity) then the solutions would inherit spacetime bounds from 



NLS, which contradicts the required divergence of their scattering sizes (cf. (1.8)) 



In this way we deduce that the failure of Conjecture 1.2 implies the existence of a 
minimal-energy solution to NLKG of infinite scattering size. This solution is global 
(in time). Moreover, by utilizing the action of time translations (in the now stan- 
dard manner), we see that this minimal-energy solution is almost periodic (= has 
precompact orbit) modulo translations: 

Definition 1.7 (Almost p erio dicity modulo translations). Fix /x = ±1. We say 



that a global solution u to (1.1) is almost periodic modulo translations (in if* x L 2 ) 
if there exist functions x : K — > R 2 and C : R + — > R + such that for every (eR and 
r\ > 0, we have 

\u(t, x)\ 2 + \Vu(t, x)\ 2 + \u t (t, x)\ 2 dx<n (1.9) 

x-x{t)\>C{rj) 

I \(0u(t,m 2 + \Mt,0\ 2 d^<v- (i-io) 

J\t\>C(_ri) 



We refer to x(t) as the spatial center function and to C as the compactness modulus 
function. 
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Remark 1.8. By (1.9) and the Gagliardo-Nirenberg inequality (enlarging C(r]) if 
necessary), 



/ 

J\x 



\u(t,x)\ 2 + \Wu(t,x)\ 2 + \u t (t,x)\ 2 + \u(t,x)\ 4 dx < r,. 

l\x-x(t)\>C( V ) 

Similarly, by compactness one may ensure that for each t € K, 



\(0u(t,0\ 2 + \Mt,0\ 2 dZ<r>- (1-11) 

l\H\<yC{r,) 

To recap, the arguments discussed thus far in this overview culminate in the 
proof (in Section [?]) of the following: 

Theorem 1.9 (Reduction to almost periodic solutions). Fix fi = ±1 and suppose 



that Conjecture [L3 holds but Conjecture l.i. fails for this value of \i. Then there 



exists a global solution u to (1.1) with energy E{u) = E c [and mass M(u(0)) < 
M(Q) in the focusing case). Moreover, u is almost periodic modulo translations 
and blows up {that is, possesses infinite scattering size) both forward and backward 
in time. 

The proof of this theorem actually shows that all minimal-energy bl owu p solu- 
tions are almost periodic, but this fact is not needed to prove Theorem 



1.4 



We refer to solutions of the type described in Theorem L9 as being soliton- 
like. While they move around in space (potentially arbitrarily), their profile does 
not change very much; specific ally, it remains inside some compact set in H*. To 



complete the proof of Theorem [LJ, we merely need to prove that such soliton-likc 
solutions do not exist. The first step in doing this is to provide some control over 
the spatial location x(t) of the soliton. From Einstein's relation E 2 = P 2 c 2 + m?c A 
relating the energy E, momentum 

P:=- u t Vudx, (1.12) 



and the rest mass m (c = 1 denotes the speed of light), we see that a minimal- 



energy blowup solution must have zero momentum. In Lemma B.2 we use this to 
show that x(t) = o(t). This is then combined with a monotonicity formula of virial 
type to obtain a contradiction. 

1.2. Notations. Our convention for the Fourier transform is as follows: 

/(0 = EF / e-^f(x)dx. 



Definition 1.10 (Littlewood-Paley projections). Fix, once and for all, a smooth 
function : R 2 — > [0, 1] obeying 0(0 = 1 for |£| < 1 and </>(£) = for |£| > §§. For 
N e 2 Z with N > 1, we define 

PT/(0 = H )/(0 . ' ltN = l (1-13) 

\m/N)-<t>m/N)]f(0 otherwise. 1 ' 

We will typically abbreviate /at :— Pn f ■ 
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2. Basic tools 

2.1. Strichartz inequalities. It will be more convenient for us to recast Klein- 
Gordon as a first-order equation for a complex-valued function via the map 

(u,u t ) v = u + i(V) _1 u t . (2.1) 



This is easily seen to be a bijection between real- valued solutions of (1.1) and 
complex- valued solutions of 

—ivt + (V)u + /i(V) — l£ (Reu) 3 = 0, (2.2) 

with (j, as in (|P]). As such, the local/global theories for the two equations arc 
equivalent. 

Corresponding to Definition 1.1, a solution to (2.2) with initial data v(Q) satisfies 
the Duhamel formula 

v(t) =e- 4 * (v) v(0)^/i(V)- 1 / e~ l( *- s)<v> (Ret;(s)) 3 ds. (2.3) 

Jo 

We will consistently use the letter u to denote solutions to ([O]) and v the corre- 
sponding solution to (^^). Note that the energies and scattering sizes are related 
in the following ways: 



S I (u) = Si(v)= / \Rev(t,x)\ 4 dxdt 
J i Jr 2 

E(u(t)) = E(v(t)) = f ±\(V)v(t,x)\ 2 + %\Rev{t,x)\ 4 dx. 



Consistent with Definition 1.1 , strong solutions of ( |2.2| ) must have finite scattering 
size on compact subsets of the interval of existence. 

The linear propagator associated to this first-order equation is e _l *( v ^. We will 
need to understand how this interacts with scaling. For this reason, we record the 
basic dispersive estimate in the following form: 

Lemma 2.1 (Dispersive estimate). 

|| e -^ t (A-v )pAr/ ||^ (R2) < | <r i (A -i Ar) 2|| / || ii(R2) . (2 . 4) 

Proof. The phase function $(£) = £ • x — \ 2 t(\~ 1 £) obeys 

Hessian($)(£) = detffyflk,*®) = < 2 (A^0~ 4 - 

With this information, the method of stationary phase yields the requisite bounds 
on the integral kernel of e _lA v ^Pn- □ 

Combining this dispersive estimate and the conservation of L 2 in the usual man- 
ner (cf. |l7| and references therein) yields the following: 

Lemma 2.2 (Strichartz inequality). For each 2 < q < oo and 2 < r < oo obeying 
the scaling condition - + ^ = 1. 

II 6 ^ V ^IL t 9 L-(RxR 2 ) ~ II V ) r ^IL=(R 2 )- ( 2 - 5 ) 

Note that the implicit constant is independent of X. 

Using the A = 1 case of this lemma, the form of the free propagator, and 
Duhamel's principle, yields the following: 
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Lemma 2.3 (Strichartz inequality). Let I be a time interval and let u and v be 
solutions to the forced Klein-Gordon equations 

u u -Au + u = F and - iv t + (V)v = (S/)~ 1 G. 

Then, 

ll<V tl!B )ru||£« L r (/xRa) < ||<V t>a! )«(*b)|| i 2 (R3) + || (V) 1 -*^!!^ Lr(/xR2) 

ll<v>^iu ?LS(JxR2) < ||<v>«(to)|| ii(Ra) + 1| (v) 1 -^!!^ Lr(/xR2) 

for any to € I and each 2 < q, q < oo and 2 < r, f < oo obeying the scaling condition 

2 + 2 = 2 2 =L 

q r q r 

2.2. Symmetries. As for the wave equation, the full Poincare group acts as sym- 
metries of our equation. Non-compact symmetries provide a clear obstruction to 
proving the existence of minimal blowup solutions — they provide an easy means 
for minimizing sequences to fail to converge. In view of this, we will need some basic 
information about (as well as notation for) the action of translations and Lorentz 
boosts. As noted in the introduction, even though our equation is not scale invari- 
ant, dilations play an important role in its analysis; thus, we will need to discuss 
these as well. 

2.2.1. Translations. Our notation for translations is 

[T y f](x):=f(x-y). (2.6) 

2.2.2. Lorentz Boosts. We parameterize Lorentz boosts in a manner inspired by 
their action on the Fourier side: Given a frequency parameter v e M 2 , we define 

(t,x) = L„(t,x) := {{v)t - v ■ x,x x + {v)x^ - vt). (2.7) 

Here x 1 - and cc" denote (respectively) the components of x perpendicular and par- 
allel to v. This corresponds to a boost by velocity v/(v), which is to say that the 
observer with coordinates (t, x) perceives the observer with coordinates (t, x) as 
moving with this velocity. An easy computation shows 

L- X (t,x) = {{v)t + v ■ x,x^- + (v)x 11 +vi) =L- V (i,x). (2.8) 

Note that the linear transformation tfijty has determinant one and hence pre- 
serves spacetime volume. 

Lorentz invariance of the linear (or nonlinear) Klein-Gordon equation is precisely 
the fact that u o L^ 1 is a solution if and only if u is a solution. In particular, 

u(t, x) = exp(— i{i)t + i£ ■ x) => u o L z J 1 (t, x) — exp(— i(£)t + i£ ■ x), (2.9) 

where the new frequency parameters are related to the old via 

- L„«0,0 or equivalently, | = 4(0 := ^ + (^)C 11 - HO- (2-10) 

Note in particular that £ = if and only if £ = v, which matches with the fact that 
u represents a wave traveling with velocity £/(£). Note also that o ^ v = Id. 

Associated to the action of Lorentz boosts on solutions to the linear Klein- 
Gordon equation there is a corresponding action on initial data. We denote this by 
the symbol L v which is defined as follows: 

[L u f](x) := [e-( v »/]oL„(0,4 (2.11) 
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By ( |2.8|) , this is equivalent to 

[L;V](z) := [e-W/loI^O.x). 
Note that L„ has been specifically defined so that 



(2.12) 



(2.13) 

In view of ( |2.9D , the action of L„ is easily understood on the Fourier side. In 
particular, we have the following: 

Lemma 2.4 (Action of boosts). Let £ = t v (£), as in ( |2~10| ). Then 

{K l fT(0 = {0{0- 1 f(0- (2-14) 
As a result, boosts do not commute with space/time translations: 

hZ x T y e iT W =T s e if < v >L- 1 w/iere (f.jr) = L u (r,y). (2.15) 
TTie operator L„ is unitary in H^ 2 , that is, 

(l; 1 /, (V)g) L2 =(/, (V)L^) i2 , (2.16) 

6iti noi m general spaces: 



( L v 1 f,g) H s=(f,rn s (V)L l/ g) Hs , with m s (£) = m s (£; v) = 



(0 



2s-l 



However, \\m s 



< Ml 28 - 1 !. 



Proof. From ( |2.12[ ) and (gj) we have 

[L^fKx) = (27T)- 1 / e&f(0 d£ = (27T)- 1 / e**/(0 (f)" 1 ® df. 

JR 2 JR 2 

In the last equality we used that 



f 11 = - and so 

hence (by triangularity) the full Jacobian is 

(0 



(0 <0' 



(0 



, that is, dC = (0 _1 (0^- 



(2.17) 



Next we turn to ( 2.15 ). As Lorentz boosts preserve the Minkowski metric, 

-r(0 + y£ = -f (0 + j£ 

Hence by ( |2~T|) , 

The result now follows after inverting the Fourier transforms. 

1/2 

The inter actio n of h v wi th inner products in H x , or any if| space, follows 
easily from (|tJ) and ( |2.17|) . □ 
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Remark 2.5. The behaviour of the H x norm becomes less mysterious if we consider 
instead the physical quantities of energy and momentum defined in the introduction. 
If u is a solution of the linear Klein-Gordon equation and u = u o L^ 1 , then the 
energy-momentum vectors are related by 

(E,P)=L„(E,P), 

which follows from ( |2.9| ), ( ^.17[) , and Plancherel. This relation also holds in the 
nonlinear case; see Corollary 3.7. 

Remark 2.6. When interpreted in terms of its action on solutions u(t,x) of linear 
Klein-Gordon (as opposed to initial data), the relation ( 2.15| ) takes the form 

u(--r,--y)oL^ 1 =uoL~ l (--f,--y) when (f, y) = L v (t, y), (2.18) 

which is, of course, nothing but the linearity of the transformation L u . 

2.2.3. Scaling. As noted in the introduction, the (nonlinear) Klein-Gordon equa- 
tion does not possess a scaling symmetry; indeed, one of the key themes of this paper 
is how solutions with small Fourier support behave as solutions of the (nonlinear) 
Schrodinger equation. Because shrinking Fourier support is a way an optimizing 
sequence may fail to converge, this is something we need to address in our con- 
centration compactness principle. In this subsection, we merely introduce some 
notation for (L^-preserving) dilation/scaling operators and note how these interact 
with Fourier multipliers, including the free evolution. 

Definition 2.7. For each A £ (0,oo) we define a unitary operator D\ on L x by 

[D x f](x) = \- 1 f(x/X) 

Observe that D\ dilates by a factor A in the sense that the diameter of the support 
of D\f is A times larger than that of /. 

Note that 

m{V)D x f = D x m{\- 1 V)f (2.19) 
for any Fourier multiplier m(V). 

2.3. Useful lemmas. The remainder of this section contains certain manipulations 
of symmetries that we will need in the proof of the inverse Strichartz inequality, 



Theorem 4.9 



Lemma 2.8. Fix h £ L? x and B > 0. Then with m as in Lemma pO , the set 

K := {D^L^moiV^e^Dxh \ \v\<B and B^ 1 < A < oo} 

is precompact in L x . Moreover, the closure of TC does not contain unless h = 0. 
If h is the characteristic function of [—1, l} 2 , then 

supp(ff) C {\(\ < (B)}, \\g\\ Li > (B)-\ and [ \g(x)\ 2 dx < @, (2.20) 



\x\~R 



(R) 



all uniformly for g £ JC. 

Proof. Careful computation shows that 

[D^h-'moiVr^Dxh]^) = h o G(0 (2.21) 

where G(|) = A^ 1 (l/A) - v\ = £ x + (i/)|H + Ai^A^O - 1]. The conclusions of 
the lemma will follow from some basic properties of this function G. 



14 



ROWAN KILLIP, BETSY STOVALL, AND MONICA VISAN 



First we note that G is a bijcction on R 2 ; indeed, the computation ( 2.21 ) reveals 
it to be the composition of dilations, translations (cf. e lux ), and the bijection l~ x 
associated to LjJ 1 . Moreover, the Jacobian of G is 

det[G'] = (v)+\- 1 {\- 1 £)- 1 v-i 
which is uniformly bounded both above and below: 



i-i 



< 



|det[G']l < {v). (2.22) 



(Here we used lA" 1 ^ < (A" 1 !) and \{v) - \v\][{v) + \v\] = 1.) 

From (2.22) we can conclude that h i-» h o G is a uniformly bounded family 
of operators on L 2 for \v\ < B and A £ [B~ l ,oo\. Note the inclusion of A = oo 
here; this is possible since lim^oo G(£) = £- L + {v)£}, which is still a bijection with 
bounded Jacobian. 

To finish the proof of precompactness, it suffices to show that h o G varies con- 
tinuously in L 2 as A and v vary over over the compactified region. By virtue of 
the uniform boundedness of h n- h o G we may safely replace h by an element 
of C^°(K 2 ). With this reduction, the result becomes an easy consequence of the 
continuity of G(£) as a function of A and v and the fact that 

[M-IH]iei<|G(OI<[M + IH]lll, (2-23) 

which follows from |(A -1 ^) — 1| < A _1 |^|, a consequence of the subadditivity of the 
square-root. 

That K. stays away from zero follows immediately from the upper bound in ( 2.22 ). 

Lastly we turn to ( 2.20|). In clusion of the Fourier support follows immediately 
from the lower bound in ( [2.23 ), whi le th e second claim and the case R < 1 of the 
last ineq uality follow directly from ( 2.22 ). To treat the case R > 1, we note that 
by ( 2.21 ), the Fourier transform of a fixed g € JC is the characteristic function of a 
set with piecewise smooth boundary and 

Length(9suppg) < ||(G _1 )'|| L =o (R 2. K 2 X 2 ) < (v). 

Therefore, for each vector \rj\ < 1, 

\x\\g{x)\ 2 ^^ dx = \r 1 \- 1 [ m-v)-g(0\ 2 d£<W). 

\mm J R 2 

The estimate then follows by adding together this estimate for vectors rj of length 
i?" 1 pointing in a fixed collection of directions. (The exact number of vectors 
needed is dictated by the constants in |x| ~ R.) □ 

Lemma 2.9. (a) Suppose g n — g weakly in H\ and X„ — > A 6 (0, oo). Then there 
is a subsequence so that 

[e^ A » i<A » 1V) ffn](j)) -> [e-^ 2 ^ 1 ^ g]{x) for almost every (t,x) G R x R 2 . 

(b) For X n — > A e (0, oo) and fixed g £ H x , 



iKt(X-^) g _ e -iXH(X-W) g ll 



(c) Fix 9 € (0, |) and suppose g n g weakly in L x and X„ — > 00. Then there is a 
subsequence so that 

[e- iX « t[ ( X " lv) - 1] P<x£g n ](x) -> [e ltA/2 g]{x) for almost every (t,x) £ R x R 2 . 
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(d) For A„ — > oo, 9 e (0, \), and fixed g G L 2 , 

|| e -^'[(A-v>-i]p £A ^ _ e «A/a fl || i{ _> o. 

Proof. With regard to almost everywhere convergence of a subsequence, it suffices 
(via Cantor's diagonal argument) to work on a generic cube, say, (t,x) € [— L, L] 3 . 
This in turn can be deduced from L 2 x convergence of a larger subsequence there. 

Consider part (a). First we show the existence of an almost everywhere con- 
vergent subsequence; only after that will we identify the limit. By the Strichartz 
inequality Lemma |2.2| , 

hmsuplKS^^V)^-^ 4 ^ 1 ^.^!!^ < A \\g\\ Bl . 

Combining this with Rellich's Theorem, specifically, compactness of the embed- 
ding W 1 ^ 4 ' 4 (M. 3 ) L 2 {[— L,L] 3 ), we obtain an L\ x (and thence a.e.) convergent 
subsequence on this cube. 

The fact that we have local convergence in L\ x also allows us to identify the 
limit: for all F e C~(tt x K 2 ), 



lim / / F{t,x)[e^ lX " t{x ^ v) g n ](x)dxdt 



n— >oo 



lim / g n (x) / [e a ''< A " v ^F(t, -)]{x)dtdx 



g{x) / [e^ 2t ^-^)F{t,-)]{x)dtdx 
F(t,x)[e- lxH ^ lw) g]{x)dx dt. 



The proof of (b) is easily adapted from the proof of (d) which we give below. 
We now turn to the more subtle part (c) where A n — > oo. By noting that 

A 2 t[(A- 1 0-l] = ^|e| 2 + O(tA- 2 |^| 4 ) as A„->co, (2.24) 



we deduce that for 9 < \, 

|| e -iA>t[<^-V)-l] J , _ c «A /ajP< || _^ Q _ 

Thus it suffices to prove convergence of a subsequence of e ltA ^ 2 P<\e g n on our 
generic cube [— L,L] 3 . The key to doing so is the well-known local smoothing 
estimate for the Schrodinger equation: 

| [(V)^ tA / 2 /] {xfdxdt < L ■ ||/||1 2(R2) ; (2.25) 

-L.L] 2 

see [|l0[ |6[ |7). This estimate implies 

\\(d t )H^e^l 2 g n \\ LlA[ _ L L]3) < L 1 / 2 ■ \\g n \\ L l^) (2-26) 

and hence by Rellich's Theorem, the existence of an L 2 x convergent subsequence on 
[-L, L} 3 . The identification of the limit follows by testing against FeC™(lx R 2 ), 
as above. 

Lastly, we address part (d). By the Strichartz inequality Lemma 2.2 and its 
analogue for e ltA / 2 , it suffices to treat the case when g is a Schwartz function. 
(Note the importance of uniformity in A in Lemma 2.2.) Next we note that by 
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the (uniform in A) dispersive estimate (2.4) and its analogue for the Schrodinger 
propagator, 

\\ e -^lt[(\- 1 v)-i] p .11 , II it A/2 II <T-i|loll 4/, 



Thus we are left to control the region |t| < T. First we note that 



| e -iA= t [<A-V>-l] p<Ae9 _ ( 



itA/2 



\L<*>L%{\t\<T) 



< 



(A„ 2 T + A n 



\9\\m 



(2.27) 



which follows from (2.24) and the fact that 



\P- 



A?,5 



L?Ll(\t\<T) 



< A, 



-49 1 



9\\Ht- 



On the other hand, by the Strichartz estimates and Sobolev embedding, 



-iA^KA-'v}-!] p 



<a«5|| l?L 6 + ||e 



itA/2 



< 



(2.28) 



Interpolating between (2.27) and (2.28), we obtain 



hm lle-^K^-Hp. 



n— too 1 

for each fixed T. 



<a«9 



JtA/2, 



\t\<T) 



□ 



The significance of this lemma for us is that it provides the input for the following 
variant of Fatou's lemma due to Brezis and Lieb (see also ]27], Theorem 1.9]): 

Lemma 2.10 (Refined Fatou, Q). Let d > 1 and 1 < p < oo and suppose {F n } C 
L p (R d ) with limsup ||P n ||p < oo. If F„ — > F almost everywhere, then 

\F n \P ~ \F n - F\p - \F\r\dx ^ 0. 

In particular, if G n — > F in IP sense, then 

Mmsup\\F n -G n \\ L p <limsup (\\F n f LP - \\F\\ P LP ) 1/P . (2.29) 

2.4. Elliptic estimates. In this subsection, we first record a special case of the 
sharp Gagliardo-Nirenberg inequality of Weinstein [48 and then discuss some con- 
sequences for our equation in the focusing setting. 



Theorem 2.11 (Sharp Gagliardo-Nirenberg, |@). For all f € H^(R 2 ), 

\\f\\u<nQ\\-j\\f\\h\\vf\\h- 



(2.30) 



Here Q denotes the unique positi ve radial Schwartz solution to AQ + Q 3 = Q. 
Moreover, equality holds in ( |2.30 ) if and only if f{x) = aQ(\(x — xq)) for some 
a e C, A e (0, oo), and x € R 2 . 

It is not difficult to prove the existence of Q, for example by variational methods. 
A proof of uniqueness (along with earlier references) can be found in Kwong [^6| . 
Integrating the equation for Q against Q and x ■ VQ yields 

E(Q)~ [ i|Q| 2 + i|VQ| 2 ~ i|Q| 4 dx = / \\Q\ 2 =:\M{Q). (2.31) 

Js. 2 Jm 2 

This is known as Pohozaev's identity. 
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Proposition 2.12 (Energy coercivity). Let u : IxM 2 — > K be a solution to (IT) in 
the focusing case with initial data (u(0), ut(0)) — (uo,u\) £ LL\ x L 2 whose energy 



E{u) = E{u ,ui) = 



obeys E{u) < E(Q). 
(i) IfM(u(0)) < M(Q), then 



\u{t)\ 2 

2 

Vu{t)\ 2 



i| Ul | 2 + i|V Mo | 



2 ■ H-' 2 \\u \Ux 



R 2 



\u t (t)\ 2 dx < 2E(u) < M(Q) 
\u t {t)\ 2 dx < 2E(u) < M(Q) 



for all t £ I. As a consequence, 

2E(u) < \\u(t)\\ 2 H i H 
(ii) If M{u{Q)) > M(Q) then 

M(u{t)) > M(Q) ana 

for all t € I . Moreover, 



u t {t)f Ll < 4E(u). 

Vu(t)\ 2 > M{Q) 



d tt M(u(t)) = 2 / \u t \ 



IVul 



l dx > 6 



|w t | 2 dx. 



(2.32) 
(2.33) 

(2.34) 
(2.35) 

(2.36) 



Proof. We begin with part (iV As energy is conserved, E(u(t)) < E(Q) for all 

t G /. Combining this with ( 2.31 ) and the sharp Gagliardo-Nirenberg inequality 
shows that 



M(Q) > 2E{u{t)) > / \u(t)\ 2 + \u t (t)\ 2 > M{u{t)). 



M(u(t))<M(Q) = 

By definition, solutions are continuous in H], x L 2 and so we see that ( 2.32| ) follows 
by a simple bootstrap/continui ty ar gument. 

We now turn to the proof of ( [2.33 ). By the sharp Gagliardo-Nirenberg inequality 
and (^3l]), 



M(Q) > 2E(u(t)) > / \ Ut ( t )\ 2 + \u{t)\ 2 +[l-^§]\Vu\ 2 dx 

and so, neglecting the ut term and doing a little rearranging, we find that 

[M(Q) - M(u(t))] [M(Q) - ||Vu(t)||| s ] > 0. (2.37) 

From (|3|), we see that M(u(t)) < M(Q) throughout the interval of existence. 
Thus ||Vu(£)|| 2 2 < M(Q) and so, by the sharp Gagliardo-Nirenberg inequality, 



\u(t,x)\ 4 dx < \ \ \u{t,x)\ 2 dx 



The estimate (2.33) now follows directly from the definition of energy. 

The proof of part (ii) closely parallels that of part (i). The first inequality in 
( [2.35| ) follows from a simple bootstrap argument based on the fact that if M(u{t)) = 
M(Q) then by the sharp Gaglia rdo-N irenberg inequality, 2E(u (t)) > M(Q) — 
2E(Q). The second inequality in (2.35) follows from the first via (2.37). 
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The evaluation of duM(u(t)) is an elementary computation. Note that the an- 
swer can be rewritten in the form 

d tt M(u(t)) = -8E(u)+ [ 6|u t | 2 + 2|Vu| 2 + 2|u| 2 , 

,/R 2 

from which ( 2.36| ) follows from 2E(u) < M(Q) and both inequalities in ( 2.35 ). □ 

3. Local theory 

By using L\ x and L^H^, as the basic spaces in a contraction mapping argument, 
the Strichartz estimates directly yield local well-posedness, persistence of regularity, 
and stability results. The arguments leading to local well-posedness can be found 
in any textbook on dispersive PDE, for example, 0. A proof of the persistence of 
regularity result may be adapted from the proof for NLS given in 43, Lemma 3.10]. 
We formulate these basic statements in the context of the first-order equation ( |2.2[ ); 
the reader should have no difficulty reformulating them for solutions to 

Proposition 3.1 (Local well-posedness for Ft 1 initial data). Let v g H^(M. 2 ). 
Then there exists a unique maximal-lifespan (strong) solution v : / x M 2 — > C to 
( p.2| ) with v(0) = vq. Furthermore, the following hold: 

• (Blowup alternative) IfT = sup/ is finite, then \\v{t)\\^i — >• oo as t — > T . 

• (Conservation laws) The energy and momentum are finite and constant in time. 

• (Scattering) If v does not blow up forward in time, that is, if S[o j00 ) (v) < oo, 
then there exists v + £ H^iM 2 ) such that 

lim \\v(t)-e- H ^v+\\ Him =0. (3.1) 



t— >oo 



Furthermore, for each v + £ H^(M. 2 ), there exists a unique v which solves (2.2) in 
a neighbourhood of +oo and satisfies (3.1). In either case, 

e(v) = ilKll!t (R2) . (3.2) 

Similar statements hold backward in time. 

• (Small data result) If \\vo\Ih 1 * s sufficiently small, then v is global and moreover, 

Sr(v) < E{v) 2 . 

• (Small solution to LKG implies small solution to NLKG) If I is an interval, 
G I, and \\ Ree _I ^ v ^uo||i<i (/ x r 2 ) * s sufficiently small, then I is contained in the 
lifespan of v and 

Si(v)<\\Ree-^v \\i UlxR2) . 

• (Persistence of regularity) Let I C R and assume that Si(v) < L. Given s > 0, 

ll(V) S+ ^|| i?L . (/xR2) <L,s, g ,r ||(V) s+1 « ||l 2 ( R 2 ) (3.3) 
for each q and r obeying 2 < q < oo and ^ + 7 = 5- 

Remark 3.2. Note that in the small-data setting, the Gagliardo-Nirenberg inequal- 
ity shows that ~ E(v). As a consequence, Hwllioo^-i < ||wo||ffi- 

In the defocusing case, energy controls ||u (t)!!^ and so finite-time blowup cannot 
occur. In the focusing case, the energy is no longer coercive in general; nevertheless, 
by part (i) of Proposition 2.12 we do obtain the following: 
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Corollary 3.3 (Global well-posednes s). In the defocusing case, any initial data 
vq G H\. leads to a global solution to (2.2). In the focusing case, any initial data 
vq G H\ obeying 

/ \Rev Q \ 2 dx< / Q 2 dx and E(v ) < E(Q) 
Jr 2 Jr 2 

leads to a global solution. Here Q denotes the ground state, as in Theorem 2.11. 



In view of this corollary, the main objective of this paper is to prove that these 
global solutions obey spacetime bounds, which, by the local theory mentioned 
above, will also imply scattering. As described in the introduction, we will ar- 



gue by contradiction, showing that failure of Theorem 1.4 implies the existence of 
minimal-energy counterexamples (see Section Q). A key ingredient is the follow- 
ing stability theory. The significance of stability theory has really only come to 
the fore with the investigation of scaling-critical equations. The archetypal argu- 
ment appears in Q ; see also [^2| . A detailed proof for the mass-critical nonlinear 
Schrodinger equation is given in p3[ , and only minor modifications are required to 



extend that argument to the first-order nonlinear Klein-Gordon (2.2) 



Proposition 3.4 (St ability theory). Let I be an interval and let v be an approxi- 



mate solution to (2.2) on I in the sense that 

-w t + (V)w + At(V) _1 (Re{i) 3 + ei + e2 = 0, 

with small error terms e\ and ei- Assume that 

IK V ) 1/2{) IL~L2 (/><R 2 ) < M and || ReS|| L 4 x(/xR2) < L 

for some positive constants M and L. Let to € I and let vq satisfy the condition 

\\(V) 1/2 (v -v(to))\\ Li(m2) <M' 

for some positive constant M' . Then if < e < E\{L,M,M') and if vq and the 
error terms satisfy 

\\e-^-^) {vo . v{to ))\\ LUlxR2) <e, 

IK V ) e lHi*4 3 (JxR3) + W e ^LlHl /2 (IxR 2 ) - £ ' 

then there exists a solution v to ( |2.2| ) with initial data vo at time t = to. Further- 
more, the solution v satisfies 

\\v-v\\ LUlxm < £ C(M,M',L) 

\\v-v\\ L? ^ IX1P) <M'C(M,M',L). 

The local theory described so far treats time as an absolute, which jars with 
the Lorentz invariance of our equation. Moreover, it does not allow us to consider 
boosted solutions uo L v , even for small values of v. The next lemma remedies this 
by proving local existence in a larger spacetime region; we then make some basic 



observations about the behaviour of the boosted solutions in Corollary 3.7 



Lemma 3.5 (Boostable local solutions). Given initial data (uq,Ux) £ if* x I? x , 
there is an e > and a local solution u to ( |l.l| ) matching this data {at t = 0) and 
defined in the spacetime region fl — {(£, x) : \t\ — e\x\ < e}. Moreover, 

\\ u \\L q t Li(n) ■= Wxnu\\L" t L-(RxR 2 ) < oo for each 2 < q < oo and ± + ± = \, 
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lr(H^xL 2 )(U) '■= SU P / Xn(t,x)[\u t (t,x)\ 2 + \Vu(t,x)\ 2 + \u(t,x)\ 2 ] dx < oo, 



lim sup / [|u t (t,a;)| 2 + \Vu(t,x)\ 2 + \u(t,x)\ 2 ] dx = 0. (3.4) 

R ^°° \t\<sRJ\x\>R 

The solution u with these properties is unique. 

Remark 3.6. As we will see from the proof, if (uq,ui) leads to a global solution, 
then we may take any < s < 1. 

Proof. Both existence and uniqueness follow directly by combining the local theory 
desc ribe d so far with finite speed of propagation. First we note that by Proposi- 
tion^^ there exists Tb = Tq(uo, u%) > so that there is a (unique) local solution u 
to ( |l . l[ ) defined on the spacetime slab \t\ < To and having finite spacetime norms 
there. 

Next, let (f> denote a smooth cutoff function with <p(x) = 1 outside the unit ball 
and <f>(x) = when |x| < i. Given any r\ > (in particular, the threshold for the 
small data theory), there is an R sufficiently large so that 

/ [|^(a;/i2o)«i(aj)| 2 + \V[<j}(x/R Q )u (x)}\ 2 + \</>(x/Ro)u {x)\ 2 ] dx < r). 

JR 2 



Thus by Proposition p.l[ there is a global solution u to (1.1) with initial data 
u(0,x) = 4>{x / Rq)uq{x) and iit(0,x) = <j){x / Rq)ux(x) . By uniqueness and finite 
speed of propagation, u provides an extension of u to the spacetime region where 
|x| — |t| > i?o- Note that u has finite (global) spacetime norms. 

To recap, we have proved that there is a unique local solution with finite space- 
time bounds on the region where \t\ < Tq or \x\ — |t| > R<j. This includes the 
region il provided we choose e < T /(l + R +T ) and so settles the majority of 



the lemma; it remains only to prove the tightness statement (3.4). 



The proof of (3.4) is a simple variation on the u construction above. Indeed, 
if u( RS) is the solution to (|l.l|) with initial data u^ R \0, x) — 4>(x / R)uo(x) and 
Ut (0, x) — 4>(x / R)ui(x) then (cf. Remark 3.2) 



lim Hu^IUoojj^MxK 2 ) + l|9 f M (/?) || L ~L2(RxR 2 ) = 0. 
R— >oc 



Taking R < (1 — e)R, this proves (3.4) because u and agree on the region 



\x\ - \t\ > R. □ 



Corollary 3.7. In view of Lemma 3.5, any initial data u(0) € H\ and Ut(0) G L 



2 



lead to a solution u to (1.1) in a spacetime region of the form 57 = {(t, x) : \t\—s\x\ < 



1-2 



e} for some e > 0. For~^ < e, we have 

(i) uoL u (t,x) is a (strong) solution to ( |l.l| ) on (— e,e) 

(ii) v i-> (u o L v (0, x), [u o L„] t (0, x)) is continuous with values in H^. x L 2 . 

(iii) The energy and momentum obey Einstein's relation: 

{E{uoL v ),P{uoL v )) =L- 1 (E(u),P(u)). (3.5) 

In particular, 

E(u o L u (tf) 2 — P(u o L u (t)) 2 is independent of t and v . (3-6) 
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Remark 3.8. The square- root of the quantity in (3.6) is usually known as the rest 
mass; compare the famous equation E 2 = P 2 c 2 +m 2 c 4: , or its P = case E = mc 2 . 
We will not use the term rest mass here so as to avoid confusion with the (non- 
conserved) quantity M(u(t)). 

Proof. It is not difficult to verify that if < e then L v maps (— e,e) x M 2 

into O. Thus u o L v is defined in the region claimed. As L v is volume preserving, 
we also see that 



uoL v {t,x)\ dx dt < J J \u(t, x)\ dx dt < oo. (3-7) 

This estimate allows one to justify the elementary manipulations which guarantee 
that u o L v is a distributional solution. To prove that it is a strong solution we need 
to prove that it belongs to C$(H* x L 2 ). We will settle both this and part (ii) of 
the corollary by showing that 

(t, v) H> (u o L u (t, x), [u o L u ] t (t, x)) 

is a continuous function from {\t\ < e, \v\ < e} to H% x L 2 . 

Let u hn denote the solution to the linear Klein-Gordon equation that has the 
same initial data as u, namely, 

u lin (t) = cos(t{V))«(0) + (V)" 1 sin(t(V)K(0), 

and let u = u — u hn denote the difference. The action of Lorentz boosts on solutions 
of the linear equation was described in subsection 2.2.2| ; in particular, 

u lhl o L v (t, x) + ^(V)- 1 a t [^^ lin o L v ](t,x) = e^ t{v) U[u(0) + t{V) _1 «t(0)]. (3.8) 

The action of e~ %t ^h v on the Fourier side (cf. Lemma |2.4| ) clearly shows that 

(t, v) ^ (u lin o L v {t, x), [u Un o L u ] t (t, x)) 

has the required continuity. This leaves us to consider the effect of Lorentz boosts 
on u, which obeys 

u u — Au + u = — /j,u 3 and w(0, x) — u t (0, x) = 0. 

As u — u — u hn , Lemma 3.5 and the Strichartz inequality imply 

NU?£j(n) + ||Vt,x«IUf>i|(n) < oo for each 2 < q < oo and | + ± = \. (3.9) 

We also have 



lim sup / [\u t (t,x)\ 2 + |Vw(t,x)| 2 + \u{t,x)\ 2 ] dx = 0. (3.10) 

R ^°° \t\<eRJ\x\>R 



Again this follows by writing u = u — u : For u we use (3.4); the analogous 
estimate for u lm follows from finite speed of propagation and energy conservation 
(cf. the proof of Q). 

We now turn to the main part of the argument. We will give complete details 
for the proof that uoL v and its time derivative are bounded in the requisite spaces 
and that continuity holds at the point (t = 0, v = 0). The reader should have little 
difficulty adapting the argument to prove continuity at other points, for example, 
by using the group property of the transformations. 

Let 7" denote the stress-energy tensor for u, which has components 

r 00 = in 2 + i|v«| 2 + i|u| 2 , r 0j = P° = -u t u h 

and T jk = UjU k - Sj k [T 00 - |ttt| 2 ] , 
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where j, k € {1,2}. Note that thi s is t he stress-energy tensor associated to the 
linear Klein-Gordon equation (cf. (3.13) below). For our immediate purposes, we 
do not need to consider the full tensor, but merely the 3- vector p with components 

p« : = ( v )T 0a + Vl T la + v 2 T 2a , a e {0, 1, 2}. 

This vector has divergence 

V t ,s • p = d t p° + dip 1 + d 2 p 2 = -/J,u 3 [(v)u t - v ■ Vtt], 

and was deliberately constructed so that 

p ■ dS = / [(z/)p° + Ujp j ] o L v {t, x) dx 



L„(t,R 2 ) 



\d t (u o L v )\ 2 + |V(fi o L v )\ 2 + |u o L v \ 2 dx, 



where dS denotes surface measure times the unit normal. 

Both u and Vt^u vanish on the surface t — 0. Thus we may estimate the 
required norm by applying the divergence theorem to p on the region 

Qt,u ■= {(*, y):0<s< {v)-\t — v ■ y)} U {(«, y) : {u)-\t - v ■ y) < s < 0} C fi, 

whose boundary comprises (0,R 2 ) U -L„(i,R 2 ). There are two technical obstacles 
to doing this: p may not be smooth enough and Qt,^ is not compact. The former 



can be dealt with by the usual mollification technic 



function. The latter was the reason for proving ( 3.10 ), as we will explain 



ue of convolving with a C° 



The main estimate required to prove boundedness and continuity is the following: 



|V t , x -p\dyds < (v) 



fit, 



\u(s,y)\ 3 \Vt, y u(s,y)\ dyds 



(3.11) 



fit ,t> 



^ ||«IU|Lg(n t ,„)||Vi >x i2||io OL a(n) ->• as (t,z/)-»Q. 



The last step follows from Lemma 3.5, (3.9), and the dominated convergence theo- 
rem since fi^j, — > as (t, v) — > 0. 

We are now ready to apply the divergence theorem. Let <j> : R —> [0, 1] be a 
smooth function with <f>(r) = 1 when r < 1 and 4>(r) = when r > 2. Now let 
ipn(s,y) = <f>( ft ) where R > will be sent to infinity. Applying the divergence 
theorem to ptpR and invoking ( 3.1Cl| ) and ( 3.11 ) yields 



\d t (u o L v )\ 2 + |V(« o L v )\ 2 + \u o L v \ 2 dx 



lim ■ 

R— ¥00 



[\d t {u o L v )\ 2 + |V(« o L,)| 2 + |u o L,| 2 ] 

< limsup / / \ipRV t , x • p| + |p ■ Vt^Vfll dyds 

< [[ I Vt,x -Pi + limsup i / / |(Vt, B )u| 2 da?tfc 

J Jn t:V R^oo ttJ-eR.J\x\~R 



< 



I Vt.ar ♦ Pi »• as (t,I/)-»0. 



(3.12) 



fit,! 



This settles parts (i) and (ii) of the corollary. 
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The proof of part (iii) revolves around the stress-energy tensor associated to the 
nonlinear Klein-Gordon equation: 



j-OQ _ 



2"t 



2-M 2 



fM 4 , 



U t Uj, 



00 



and 7~ J = UjUk — 8jk [T' 
As u is a solution, this is divergence free, that is, 



(3.13) 



d t T a0 + diT al + d 2 T a2 = for all a G {0,1, 2}. 



Applying the divergence theorem for all values of a, one deduces that (3.5), and 
hence (3.6), holds. The estimates needed to deal with the non-compactness of fl tM 
can be found in Lemma [T^. □ 

Corollary 3.9 (Boosting to zero momentum). Let (uq,ui) G H\ x L\. In the 

focusing case assume also that M(u(0)) < M(Q) and E(u) < E(Q). Let u : 
R X R 2 — > R be the solution to (1.1) with this initial data. Then there exists v G R 2 
such that u u := uo L v is a global (strong) solution to (1.1) with 

P{u v ) = 0, (3.14) 
E(u") < E{u), (3.15) 
M(u u (Q)) < M(Q) in the focusing case. (3.16) 

Remark 3.10. In view of ( |3.5| ), if u is not identically zero then there is at most one 
value of v (cf. ( |3.18| )) so that P(u v ) = 0. Moreover, by (|J) we have E{u u ) = E(u) 
if and only if P{u u ) = P(u) = 0. 

Proof. Without loss of generality, we may assume that P{u) ^ 0, the result being 
trivial otherwise. In particular, this means that u(t) ^ for all t. 

By Remark 3.6, for every to G R, the function u(- — to) satisfies the conclusions 
of Lemma 3.5 for every < e < 1. Thus by Corollary |3.7|, u v is a global strong 



(3.17) 



solution to (1.1) for each v G R 2 . 

By d3.5|), it is possible to find v such that P{u v ) — if and only if 



\P{u)\<E{u). 
Moreover, in this case, one must choose 

P(u) 



To see that Q3.17 ) holds in our case we observe that 

\P(u)\ < ||Vu|| iS ||«t|U S < il|V«||ij + iHutHij < E{u). 



(3.18) 



(3.19) 



The first step here is the Cauchy-Schwarz inequality; note that equality would 
imply that Vu(£) points in only one direction for all x G R, which is inconsistent 
with u(t) ^ 0. The last inequality in ( p. 19 ) rests solely on the definition of E(u) in 
the defocusing case and is an application of ( 2.33j) in the focusing case. 
The bound ( pT5| ) is a trivial consequence of ( |3.6[ ). Indeed, we have 



E(u sv ) < E(u) for all < s < 1. 
This follows from the fact that 

E(u s ") = {sv)E{u) + sv ■ P(u), 



(3.20) 
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which is convex in s and is bounded by E{u) at both and 1. By the sharp 
Gagliardo-Nirenberg inequality (cf. the proof of fl2.32|) ), M(u su (0)) < M(Q) im- 
plies that 

M(u sv (0)) < 2E{u s ») < 2E(u) < M(Q). 



Thus by the continuity established in Corollary 3/7, we must have M( u su (0 )) < 
M{Q) for each < s < 1, and in particular at s = 1. This settles ( 3.1 6| ) and 
completes the proof of the corollary. □ 



4. Refinements of the Strichartz inequality 



The goal of this section is to prove an inverse Strichartz inequality (Theorem 4.9), 
which is an essential ingredient in the concentration compactness argument. As 
described in the introduction, we will use Tao's sharp bilinear restriction estimate; 
sharpness here refers to the minimality of the spacetime integrability exponent. 
This sharpness is not important for our purposes; what is important is that it 
provides exponents beyond the range of the Strichartz inequality. 

To apply Tao's estimate, we need to reduce to a fixed compact set in Fourier 



space. This is the role of Lemmas 4T and L5. The former lemma demonstrates 
that a linear evolution cannot be big without a significant contribution from one 
of its Littlewood-Paley pieces. The proof employs the same ideas as proofs of the 
inverse Sobolev inequalities needed in energy-critical cases; however, the precise 
formulation and argument reflects the authors' particular perspective, as advertised 
in !!• 

In the L^-critical case (as opposed to the L^-supercritical regime), the charac- 
teristic frequency scale of a minimal blowup solution can be arbitrarily small. Also, 
to a first approximation, Lorentz boosts act as translations on the Fourier side (cf. 



Lemma 2.4); thus we see that the characteristic length scale of a wave packet is 
not indicated by the Littlewood-Paley annulus to which it belongs. We capture 
the dominant portion of the boost parameter by subdividing annuli into tubes of 
unit width; these tubes are natural since they are images of the unit cube under 
the action of i y . The proof that a large linear evolution can be attributed to some 



tube rests on the bilinear Strichartz inequality Lemma 4.5 

1/2 

Lemma 4.1 (Annular decoupling). For f € H x ' , 



|e- lt<V) /||^ <sup||e-*« v >iW|| r4 ||/ 

1 "^t.x JV 



Pro of. U sing the Littlewood-Paley square function estimate, the Strichartz inequal- 
ity ( p.q ), and Bernstein's inequality, 

lle-^/H 4 ^ 



<£ / / | e -* t<v> /M|V 4 ' (v> ^r^ 

M,N ® * 2 

< V lle~ 4 * (v) Aril lle~ 4 * (v) f*A\ lle~ 4 * (v) fJI lle~ 2t(v) fJI 

M<N 

< sup||e~ lt<v> /K||^ 4 ^ \\fM\\ H V3\\fN\\ H i 



.1/3 



M<N 
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<- II -i*<v>, |,3 (Af) 1 / 6 , 
<sup||e 'Jk\\ l * x 2^ ( N \i/6 \\JM\\ H i/2\\jN\\ H i/2 

K ' M<N » ' 



^suplle- 14 ^/^!!^ ||/||^i/ 2 . 
The last step is an application of Schur's test. □ 



As noted, this lemma shows that if the Lf x norm of the free evolution of / is large, 
then one of its Littlewood-Paley pieces must take responsibility for this. To find the 
wave packets inside the evolution of / that are responsible, we need to subdivide 
each dyadic annulus into tubes. More accurately, they are strips or sectors in the 
two-dimensional case we are discussing; nevertheless, we use vocabulary adapted 
to the case of arbitrary dimension. 

Definition 4.2. Given N G 2 Z with N > 1, we (almost-everywhere) cover the 
Fourier support of Pn by a finite collection 7jv of non-overlapping tubes. When 
N = 1, the collection consists of just one element, [— 1|, §§] 2 - F° r A > 2, we choose 
Tn ■= {T N : < k < 20N} with 

T" N := {£ : \N < \£\ < §A and | Arg(£) - f| | < 

where Arg(£) is the angle between £ and the positive horizontal axis. Given a tube 
T G T '■= UatTzv, we define the center as follows: c(T) — if T is the unique T G 7i 
and c(T&) is defined by \c(T$)\ = 3A/4 and Arg(c(T^)) = 

Associated to each T G Tn, we define a Fourier restriction operator Pp. To 
this end, let ^ : R — > [0, 1] be smooth, obey tp(0) — 1 when \0\ < ^j, supp(^) C 

[~ To> IT)]' ana - f° rm a partition of unity via X^fcez VK^ — = 1 f° r au 6* G R. Now 
define 

except in the special case N = 1. In this latter case, T = [— 1|, ||] 2 and 

MO :=iW(0 :=^7(0- 

Remark 4.3. Careful but mundane computation shows that the Lorentz transfor- 
mation with parameter ^ = c (^) maps the tube T into the ball |£| < 2. More 
precisely, the Fourier support of L~ /t lies inside this ball. 

The principal significance of this remark is that it allows us to prove the following, 
which will be used in the proof of Theorem [I.9| . 

Lemma 4.4. Given a tube T G Tn ^e£ v = c(T) 6e zis center. Then 
uniformly in T and N . 



Proof. In view of Remark |]§ ( p^| ), and ( gig ), we need to prove boundedness of 



the multiplier m(V) defined by 

m(|) = [</>(£/A) - 0(2C/iV)]^(^ Arg(C) - fc) =: rotf), 

where £ = as in ( 2.1 0| ) . (Minor modifications are needed when N = 1.) 

We will apply the Mikhlin Multiplier Theorem (in the form of |37|, §IV.3.2]); as 
m is supported inside {|£| < 2}, this is perhaps a little excessive. For the single 
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tube when N — 1 (or indeed any finite collection of iV) the result follows easily. 
Our only obligation is to check the uniformity as N — > oo. 
The computations that led to (2.17) show that 



(4.1) 



where the entries of the matrix A(£) obey symbol estimates of order zero uniformly 
in v. On the other hand, direct computation shows 



«^„nV)^m(£) 



<a,<3 1 



The result then follows by combining these estimates via standard symbol manip- 
ulations. □ 

Lemma 4.5 (Bilinear Strichartz). Fix N > 1 and let T\,Ti E Tn ■ Suppose f,g £ 
L^iM. 2 ) obey supp / C T\ and supp g C T 2 - Then 

N 



i e -«(v>y e -«(v> 5 | 



< 



(dist(Ti,T 2 )> 



I/IUhIIsIUh- 



(4.2) 



Proof. By the Strichartz inequality Lemma 2.2, we need only consider the case 
when dist(Ti,T2) > 100. Moreover, by rotation symmetry we may assume that T\ 
lies along the £i axis. 

Given a generic F e L 2 (i x I 2 ), let 

/ := // F(t,x)[e- U{v) f]{x)[e- u{v) g](x)dxdt 

R 2 

F{-{0-{r)),Z + v)f(Og(v)d{;dr ] . 

Next we change variables according to uj = — (£) — (rf), ( = £ + i], and /3 = £2- On 
the support of the integrand, the Jacobian satisfies 



J' 1 



1 _ 






6 










(0 


(v) 



> 



(dist(Ti,T 2 )) ; 
N 2 



and j3 varies over an interval of length O(l). Using this information and the Cauchy- 
Schwarz inequality, 



\I\< 

<II^IIl 2 
<\\f\\l> 



F(u, 0/(f(w, C, P))g(v(u, C, P))Jdu d( dp 



/«(w,C,/9)) 5(»?(w,C,/3)) \J J dudC) d(3 



c, P))\ \g(r)(u, P))\ j2 duj d ( d P 



< \\F\ 



L 2 



1/(01 \9(V)\ Jdtdr) 



S\\m»\\J\\%\\f\\vMi*> 

The inequality now follows from duality and our bound on J. 



□ 
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Corollary 4.6 (Tube decoupling). Given a dyadic N > 1 and an f € L^(IR ), 

|| e -*(v) Pjv/ ||4 <iv5/2 sup || e -«(v>p T/ || || Pjv/ ||3 2 (43) 

where Tn and Pr are as in Definition 

Proof. By Holder's inequality, the fact that X^tsTn ^" >t = anc ^ then Lemma 4.5 



|e-«< v >JW 



< y || e -«<V> j p r/ ||l/4 || e -«<V>p T// ||V4 || e - it< V>p T/e - it <V>p T , / ||3/4 
T,T' 



T"£Ti 



T,T 



• t (dist(T,T')) 3 / 4 ' 



Next, by applying the Hardy-Littlewood-Sobolev inequality in the sum and then 
the orthogonality of all but adjacent /t, we get 



\e-^P N ff Li <N^ sup \\e- lt Wp Tll f\\^ fe\\PTf\\% 5 ) 



5/4 



<7V 5 / 4 sup ||e-^P T ,/||^ 4 2 llP^v/IlK 2 , 
T'' G r„ Lt - x L * 

which yields the claim. □ 



Theorem 4.7 (Bilinear Restriction, 40 ). Let f,g e L^(R 2 ) /icrae Fourier support 
in the region |£| < 4 and suppose that for some c > 0, 

M := dist(supp /, suppg) > cmax{diam(supp /), diam(suppg)}. 

Then for q > | , 

||[ e -^V> /][e -it(V> 5] ||^< cM 2-|| |/||iS || 5 | Ui _ 

Proof. While the main result in jh)| is stated for the Schrddinger propagator, the 
discussion in Section 9 of that paper explains how it extends to compact surfaces 
whose principal curvatures are strictly positive, in our case, {((£),£) : |£l < 4}. □ 

This bilinear estimate can be used to obtain a form of refined Strichartz inequal- 
ity showing that if the free evolution is large, then the Fourier transform of the 
initial data must concentrate on some cube. This idea was first developed in [ p9| 
in the 2D Schrddinger setting. By modifying their arguments (and those of H for 
higher dimensions) it was shown in §4.4] that the free evolution of this bubble of 
Fourier concentration must have nontrivial spacetime norm. These arguments ap- 
ply equally well to the Klein-Gordon setting once one has the appropriate bilinear 
estimate, Theorem 4.7. In this way, we obtain the following: 

Corollary 4.8 (Cube decoupling). For f £ Ll(R 2 ) with supp/ C {|f | < 4} ; 



< 



Here the supremum is taken over all dyadic cubes Q and fQ = Pq$ denotes the 
Fourier restriction of f to Q. Without loss of generality, we may require the cube 
Q to have side-length no more than eight. 
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Theorem 4.9 (Inverse Strichartz). Let {/„} C H^(R 2 ) and suppose that 

lim ll/n||i/i(R 2 ) = A md lim ll e ~ rf<V> /n||Lf (RxR 2 )= e >°- 

Then there exist a subsequence in n, cj) £ L 2 (R 2 ) 7 {A„} C [~,oo), {v n } Q 
{(t n ,x n )} C R x R 2 so that we have the following: 

A n — > Aqo 6 [|,oo] and v n — > ^ e M 2 

Aqo < oo e * . 

TTie functions f n in the subsequence contain a nontrivial wave packet 

= (T^e^WL^Dx^ if Xoo < oo 

[T^e^^h^Dx^KxBj if Aoo = oo 

(with power 8 — j^) in the sense that 

lim \\fn\\m - II/" ~ ^nWm - \\(/>n\\m = °) 



lim inf 

n— ► oo 

limsup||e- l *< v >(/„-0„)L4 



txR 2 ) 



< e 



1/4 



and lastly, 



L>^ 1 L; 1 T7 1 e- it " (v> / n weafc/y in L 2 X (I 



(4.4) 
E 2 , and 

(4.5) 
(4.6) 

(4.7) 

(4.8) 
(4.9) 
(4.10) 

(4.11) 



Here c and C are constants and all limits as n —¥ oo are along the subsequence. 

Proof. The final subsequence appearing in the conclusion of the theorem is the 
result of passing to subsequences on several successive occasions. For the sake 
of simplicity /clarity, we do not attempt to represent this with any notation; any 
n — > oo limit i s un derstood to be along the subsequence extracted at that point. 
By Lemma 4.1 there are dyadic N n so that 



\e-^)p N J n 



> e 2 A~ 



Further decomposing this frequency shell into tubes and applying Corollary |4.6| we 
deduce the existence of tubes T n G T/v„ so that 

Iimmflle-^P^/nlL £ s 8 A- 7 N^ 2 > e 8 ^ 7 . 

n— ►oo n ^t,x 

Incidentally, since the LHS here is < A by virtue of the Strichartz inequality, we 
may infer that N n < (A/e) 16 . 

Next, to bring ourselves into the setting of Corollary 4.S, we apply a Lorentz 
boost with parameter v n = c(T n ) to transport T n to the origin. As Lorentz boosts 
preserve the Lf x norm (they are spacetime volume preserving) we have 

limmflle-^Lr^/JI > e 8 ^ 7 



and, as noted in Remark h3, the Fourier support of L^ 1 Pr n f is contained inside 
the ball |£| < 2. This remark allowed us to prove Lemma 4.4, which we now apply 
to obtain 



lim inf 1 1 e 

n— >oo 
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and thence, via Corollary 4.8, the existence of a dyadic cube Q n , so that 

II -it(V)p p j-lf || , > _32 4-31 \-3/ll 
\\ e rQ n r<2L J n n/2 £ £ A A , 



(4.12) 



where Ajj 1 < 8 denotes the side-length of Q n - We used Lemma 2.4 here to see that 
\\P<2L^fn\\ Ll < ||L^/n|| £; < ||L» B V»|| H y« = ||/»L;/> £ ^ ( 4 -!3) 

As A" 1 € (0,8] we may pass to a subsequence so that A„ converges to some Aoo € 
[1/8, oo]. As we will need this later, we set £ n equal to the center of the cube Q n ; 
these form a bounded sequence and so passing to a further subsequence we may 
assume that they converge to some £oo. 

To continue from ( 4.12| ) we first use L p -boundedness of P<2 to discard this 
operator. Then combining the result with Holder's inequality and the Strichartz 
inequality we obtain 



_32 4-31 \ -3/11 < II _-ti<V)p t -1 f || 8 / n ||„-it(V> p r-li 



1 3/11 



3/11 



< A 8/ll|| e - it (V> p 

which then implies the existence of t n <E R and x n £ M 2 so that 

\P Qn e-*»WL£f n \(x n ) > X-'e^A-^. (4.14) 

We have now isolated all the parameters needed to find our bubble. At present, 
some are adorned with a tilde because they will be replaced later when we reorder 
the symmetries (which do not commute) and because Q n is not centered at the 
origin which n eces sitates an additional Lorentz boost with parameter £„. As in 
( 1.15 ), Lemma 2T implies that 

D xl L iX e ~ ih <V>L ^/n (4-15) 

form a bounded sequence in L x ; indeed, since |£ n | < 1 and \v n \ < N n < (A/e) 16 , the 
sequence will also be bounded in H x if A„ is bounded, that is, if A^ < oo. Hence, 
after passing to a subsequence, we have a weak limit, say, <j> € L%; moreover, <f> G H x 
when Aqo < 00 ■ That this limit has nontrivial norm follows from ( 4.1 4| ) as we will 
now explain. Let h be the characteristic function of [— i, i] 2 and define 

h n := D^ B 1 Lg n 1 m (V)~ 1 e^» 9! I>A n /i (4.16) 



where m (V) = 77!, (V,£ n ) is as in Lemma 2.4. By Lemma 2.8 these functions 
lie in a compact set in l? x and so we may pass to a subsequence along which we 
have strong (=norm) convergence, say h n — ¥ h^. The lemma also implies that 

WKWli < l, so 



j L 2 > lim \(h n , (j)} L 2 



(4.17) 



Next, using the definition of <fi, Lemma 2.4, the unitarity of the other symmetries, 
and finally (4.14), we have 

lim (hn, 4>) L 2 = lim (h n , D^LJ 1 ^ V if »< v >LJ n 7 n )„ 
= lim (T in e l ^D Xn h, e^^Lr 1 /„) 
= lim A n [Pg B e-* f -< v >L^/n] (*») 



^ 352 

> £ 3 ^4 



:l 1!) 
3 



(4.18) 
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Combining this with (4.17) yields non-triviality of i 



i 2 . 



352 j 349 

> e~ A ~ = 



(4.19) 



Next we use ( 2.15 ) to reorder the symmetries on the sequence ( 4.15 ) that con- 
verges (weakly) to cf>: 



-l p -it„(V> 



fn 



(4.20) 



where (—t n , —x n ) — Lo n (—t n , —x n ). In general, the composition of Lorentz boosts 
is not a pure boost but also includes a spatial rotation. We define v n so that 
L^" 1 Lp 1 = R n L~^ for some rotation R n € 50(2). As this is a compact group, we 

may pass to a subsequence so that R n — > R and then define <fi = P _1 0. Rotations 
commute with dilations and preserve the L x and H x norms. Thus 



with 



. 352 349 

> e~ A ~. 



(4.21) 



The weak convergence is in H x when Aoo < oo and merely in L x when Aoo = oo. 
Note that the new sequence of boost parameters v n inherits boundedness from v n 
and £ n ; more precisely, \v n \ < (A/e) 16 . By passing to a furthe r sub sequence , we can 
guarantee convergence of v n as stated in (4.5). In addition, (4.21) settles (4.11). 

Let us now turn to proving decoupling of the H x norm. We begin with (4.9), 
treating only the case = oo since the case A,^ < oo is similar but easier. By 
Lemma 2.4, 

\\<l> n \\ H i = ||L„„Z} Aii P< A «0|| ff i > (v n )- 1 \\Dx n P< K 4>\\ H i > (^>- 1 ||P<a^IU S 
and thence by A„ -t oo, \v n \ < (^4/e) 16 , and Q4.2ip , 

397 

liminf||0„|| H i >{elA) 1& U\\ Ll > (f ) 3 e. 

beginning with the basic 



This yields (4.9) as promised. Next we consider 
Hilbert space identity 

11/nllffi - \\fn - - ||^n||fl-i = 2 (/n - <t>n, <l>n) 

Again we confine our discussion to the more interesting A rl — >■ oo case, where 

(fn ~ <t>n, 4>u)hI = {^x^ e ~ fn - L^„ D\ n P< A s (f), L„ n D\ n P< A e 4>) H i 

= (L^T-^e-^Wfn - D Xn P< x eJ, mi (V-,v n )- l D Xn P< K 4>) Hl 
= (D^h^T-^e-^Wfn - P <xe J, (A- 1 V) 2 m 1 (A- 1 V;^)" 1 ^<A^) i 2 
by Lemma |J and (|2.19|) . Thus, by (|4.2l[) , 

P< A » and (A- 1 V) 2 m 1 (A,; 1 V;^ n )- 1 P< Afi 0^ (^oo)- 1 ^ in L 2 X , 

we deduce (/ n — <j>n, <f>n) h 1 ^ an d so follows. 

Finally, we turn to the proof of (4.10) which shows that the free evolution of </> n 
captures (at least as n — > oo) a positive proportion of the evolution of f n . Much 
of the dirty work has been encapsulated for us in Lemma 2.9, as we will see. We 
present the details in the case A n < 1 as adapting the argument to A„ — > oo just 
involves minor modifications to the formulae that follow. 
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Using ( 2.19 ) and making the change of variables t = A^s, x = X n y, to account 



for the fact that D x is not an isometry on L*, we have 



K lt<v> (/„-<MI 



= iip^*< v > 



{L -i e -iu(v) T -i fn _ DK(j)) \ 



, (4.22) 



which throws us into the path of Lemma 2J with g n = D^ 1 L~^e~ ltn ^ v ^T^f n and 
g = 4>. This lemma allows us to apply Lemma 2.1C, or more precisely ( 2.29Q , and 
so obtain 

limsup||e-< s ^ lv >P A - n 1 L- 1 e -^.W rK -i /n _ ^ 

n—^oo s '"' 



L' 1 



-«A^ s<A- 1 V> 



i4 



Reversing the computations in (4.22) this becomes 



limsup||e- lt<v) (/„-0„) 



< limsup e 



-*t(V) 



/n 



-iA^ s<A- 1 V) 



i4 

I if . 



This is very close to implying ( 4.1 0| ) ; in view of ( |4.4| ), all that is missing is a lower 
bound of the form 



-iA^ s<A~ 1 V) 



> 



30 1 



(4.23) 



We will not bother to determine the value of C appearing in ( 4.23 ). In accordance 
with this, we will use the symbol C to de note a variety of powers which vary from 
place to place as we develop the proof of ( 4.23 ). 

From ( LIS ) we have 



where /loo is the L^-limit of the functions h n defined in ( 4.16 ) and R is the limiting 
rotation defined above; recall 4> = Rcj). Note that hoc inherits the estim ates ( 2.20 ) 
enjoyed by h n - Together with ||0||z,2 < A (which follows from (L4) and ( 4.11 )) and 
\ v n\ ^5 {A/e) c , these estimates guarantee the existence of radii M,r ~ (A/e) c so 
that 



> 



where h = P<m X-^ _1 ^oo an d y is a smooth cutoff to {|x| < r}. 
To complete the proof of ( f4.23| ) , we merely need to show 



, c 



-iA^ s(A- 1 V) 



h II 



4/3 



uniformly in Aoo and s G [—1,1]. First we note that \\h\\ 



< 



(A/e) c , by construc- 



tion. To extend this to non-zero values of s we use the Mikhlin multiplier theorem. 
The requisite input is 

sX 2 (\- 1 ^)\< a l when \a\ > 1 

uniformly for s E [— 1, 1] and A > 1. This suffices since h has Fourier support inside 
a bounded region, namely, |£| < (A/e) c . □ 

Corollary 4.10. After passing to a further subsequence in n (and possibly changing 
4> and x n ), w e may assume in addition that the parameters in the conclusion of 
Theorem satisfy the following: If X n does not converge to +oo, then \ n = 1 
and v n = 0. Moreover, irrespective of the behaviour of X n , we may assume that t n 
obeys -p — > ±oo or t n = 0. 
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Proof. Suppose A„ — > Xoo G [|, oo). Then D\ n and D x 1 converge strongly to -Daoo 
and D^ 1 , respectively, as operators both on L 2 (R 2 ) and ff,J(R 2 ). Thus we may 



replace <f> by Dx^cj) an d set A„ = 1, whilst retaining the conclusions of Theorem 4.9 
In the case of ( pC[ ), we invoke the Strichartz inequality. By the same reasoning 



we may replace 4> with L„<f) and set v n = 0. 

We turn now to the discussion of t n . By passing to a subsequence, we may 
assume — > too <= [—00,00]; we just need to treat the case toe £ R. Invoking 



( |2.15| ), we have 

T p *MV)t _ T t »<^)- 1 t„<v) 

If A„ = 1 and v n = 0, our assumption guarantees t n — > too, and hence e**"( v ^c/> — > 
e ittxj<v>0 m 7ji(R 2 ^ j n -j- mg casej we replace <f> by e lt °°( v ^c/> and argue as above. 

If A ra — > 00, we pass to a subsequence so that e 1 ^"^ tn — > e lto ° for some too € 



[0,2tt). Thus by (|2.24D 

e 



As D\ n P <x e i is a bounded operator from L 2 (R 2 ) to if^(R 2 ), we may replace 4> by 
gitcxjg-itooA/2^^ set ^ = 0, and change x n to x„ — j^T)t n . We can then argue as 
above. □ 

5. Linear profile decomposition 



From Theorem 4.9, we may deduce the existence of a linear profile decomposition. 



We continue to work with the first order Klein-Gordon equation ([2^) . 

Theorem 5.1 (Linear profile decomposition). Let {v n } be a bounded sequence of 
H^.(M. 2 ) functions. Then, after passing to a subsequence, there exists Jq € [l,oo] 
and for each integer 1 < j < Jo there also exist 

• a function ^ (fP £ £ 2 (R 2 ). 

• a sequence {X J n } C [1, 00) such that either X J n — >■ 00 or \ J n = 1. 

• a sequence v 3 n — > v ] € R 2 which is identically if \ 3 n = l, 

• a sequence {(t 3 nl x J n )} cMxl 2 such either t J n /(X 3 n ) 2 -> ±00 or t 3 n = 0. 
Let P^ denote the projections defined by 

= l>'ei^(R 2 ), if Xi = i 

with 9 = y^Q. Then for all 1 < J < Jo, we have a decomposition 

v n Yr <■<■ * L l)^l>f,,y • ,ri. (5.1) 



Pic 



3=1 



satisfying 



Jim limsup||e lt<v) w^\\ L * jn xm = 0, (5.2) 



J-»oo - - — * " ' r " *•* 



lip (ikll^ -Ell^e^WL^^P^II^ - II^H^ j =0, (5.3) 
k i=i ' 

D-j-lrlT-^e-^^w^ 0, wea/t/y m L^(M 2 ) /or any j < J. (5.4) 
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Finally, we have the following orthogonality condition: for any j ^= j' , 



Inn {^f + ^f 



\3 \ V 3 - v 3 



W 1 



ML 

An 



= OO, 



(5.5) 



where (-s„ J , ) := L j, {t{ - P n ,x( - x{). 



Proof. Given a sequence v n as above, by passing to a subsequence we may assume 
that for some Aq,Eq > 0, 



lim 



\ v n\\Hl(R 2 ) 



= An and lim 



\e~ U{V) v n \\ Li 



5xR 2 ) — £ 0- 



(Note that the Strichartz inequality guarantees En < An.) If En = 0, then we set 
Jo = I and the claim follows; note that in this case there are no (fP 's. Otherwise, we 
apply Theorem 4.9 as strengthened by Corollary 4.10 to find jV n }, {A n }, {(t*,a;*)}, 
and (j) 1 . We set 



Note that by ( J4.ll 



D^LjT-^e-^ < v > w l n - weakly in L 2 X (M 2 ) , 



which gives (5.4) when J = 1. Moreover, by (4. 



lim {|K||^ (R2) - ||T^e<< v >L^/J A iP„V||ff 1( 



2 ) 



} = 0, (5.6) 



which is (5J3) when J = 1. 

By passing to a further subsequence if necessary, we may now assume that 



um ll^nllffi 



A\ and 



lim || e 

n— >oo 



-it<V>, 



lif x (RxR 2 ) — £ 1 



for some Ai,e\ > 0. Furthermore, by fl5.6| ) we have A\ < An, while by (4.10), 

si<e [l-c(%ff 4 . 



If E\ = 0, then we set Jo = 2 and stop. Otherwise, we apply Theorem 4.E to u> n to 
obtain parameters i/ 2 , A„, (i„,a:„), and a function <f) 2 . We then set 



2 ._ 



Arguing as above, we obtain (5.4) when j = J = 2, and also 



lim 



\\T x ^KiD xl P^n 2 m 



"nlll^R 2 )} 



= 0. (5.7) 



Adding (5J3) and (5/7), we obtain (5J3) when J = 2. 

Continuing in this fashion, one of two things occurs. Either we reach some finite 
j so that Sj = 0, in which case we set Jo = j + 1, or we obtain an infinite number 
of sequences of parameters v 3 n , X 3 n , (t J n ,x J n ), and an infinite collection of functions 
(fp , in which case we set Jo = oo . 

At this point, tracing back through the definition of the w J n , we have a decompo- 
sition of the form (O. That (|J) holds is a tautology if the algorithm terminates 
and follows from (4.10) otherwise. The claim ([T|) may be established inductively, 
arguing as for the case J — 2 above. As above, the weak limit in (5.4) is zero when 
j = J, but to conclude fl5.4|) in the case 1 < j < J, as well as the orthogonality 



condition (5.5), we will have to do a little more work. In particular, we will make 
use of the following: 
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Lemma 5.2 (Orthogonality). For j ^ j' we define a sequence of operators 

A„ X J n X n h>n X n 



(5.8) 



// the orthogonality condition (5.5) holds, then A™ converges to zero in the weak 
operator topology on ,6(L 2 (R 2 )). Conversely, if the orthogonality condition fails, 
then after passing to a subsequence, both A-tf and its adjoint converge to injective 
operators in the strong operator topology on S(L 2 (]R 2 )). 

The deduction of the r ema ining conclusions in the the orem from this lemma is 
straightforward. Indeed, ( |5.4|) for 1 < j < J follows from ( |5.5|) (which we have yet 
to prove) and Le mma |j.2| . 

To obtain (5J3), we argue by contradiction and use the crucial fact that, if defi ned , 
<jp 7^ 0. As an example, consider j = 2 and choose the minimal j' for which ( |5.5|) 
fails. For the sak e of this example, suppose j' = 4. As the orthogonality condition 
fails, Lemma 5.2 guarantees that 

aiy -> i> 1 

strongly in L 2 . On the other hand, as (j) 4 = wcak-limZ? A 4 1 L^ 4 1 T a ^ 1 e _lt «^ v ^wf i and 
the adjoint of A 24 converges strongly, 



ip = weak-limA 



24 



D Xi h J T xi e 



(w 2 n -T x 3e lt ^L^D X 3 



= 0. 



wh ich c ontradicts tp ^ 0. Note here that we used the previously pr oved j = J case 
of (5.4) to treat the first term and the minimality of j' and Lemma 5^ to treat the 
second term. □ 

We move now to the proof of the lemma. 



Proof of Lem ma \5.q . Let us first note that the adjoint of A J J is A J n J \ this follows 
from Lemma 2A and the fact that tuq appearing there commutes with translations 
and free evolutions. Thus, strong convergence of the adjoint will follow by reversing 
j and j'. 

We begin by rewriting A™ in a more convenient form; more precisely, using 
( 2.15| ) we have 



. 7 L j L j'T e 

Xl. Vli 



Writing L ?L ,/ 
mute, we obtain 



= L^ 3 j/ and using the fact that rotations and dilations com- 

Ajf =Rti' D-lh^T^e-^' ^D xt . 

-> \v 3 n — vl\, with the implicit constant depending on the upper 
m£ |. As rotations form a compact group of unitary operators, we 



Note that \v^> 
bound for \v^\ 
may neglect Rtf in what follows. 

On the Fourier side, a careful computation using (2.14) yields that for a function 
/eL 2 (R 2 ), 



Al 3 /(0 



xl <e/Ai; 
which we rewrite as 



Ai j 'f(0 = H!;> ('<■> E» F» G% m 
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with 



ujf .— "Ji , " c jj ' •= D ■> E jj ' ■= (£ X " , 



pjj' .— e - is " ii/K > (pi' ■— e -Wn i/K 

Here, we used the notation 

4(0-^ + <^"-<0^ and ^(0 - A4(A" X = ^ + MC 11 - <!>A^, 

while * is used to denote the pullback, that is, (£„)* f := f o £„. 

To continue, by passing to a subsequence, we may assume the following: 

(i) Either + oo or % — ► A<x, G (0, oo). 

(ii) Either A^ | — v 3 n | — > oo or there exists a diffcomorphism , whose Ja- 

cobian is bounded both above and below, such that £ "., — > ^oo, uniformly 

on compact subsets of R 2 . 
(hi) Either |s^'|/(A^') 2 -> oo or s#'/(A£') 2 ^ Soo el. 
(iv) Either l^'l/A^' -> oo or y^'/A^' 4 fc el. 

We start by addressing the second half of the lemma. Assume therefore that 



(5.5) fails. In this case, it is easy to check that each of B-tf through G 3 J converges 
strongly to an injective operator and hence so does their product. 

It remains therefore to consider the case when (5.5) holds. Directly from the 
definition, we see that A™ forms a uniformly bounded sequence of operators on 
L 2 (R 2 ); thus it suffices to show that 

lim (A% <j>,ip) L l(B!>) = 

for every pair of Schwartz functions <fi and ip with compact Fourier support. 
The sequence of functions 

c (y(W)) 

t H-)- ; 

is uniformly bounded and converges uniformly on compact sets by virtue of our 
assumption that the sequences v 3 n , v £ converge in R 2 and that the sequence A^ 
converges in [l,oo]. Consequently, B-tf and its adjoint converge in the strong 
operator topology on Z?(L 2 (R 2 )), and we may disregard B£> in what follows. 
As E 3 J through G^ are isometries on L|°, 

\{AiU,i>) Li (u*)\ <^ii^iu r iiv;iui; 

An 

indeed, the first factor on the right is the norm of Ci™ in L|°. This proves weak 
convergence to zero in the case when X J n /X n —> 0. The case when X n /X J n — > oo can 
be handled by reversing the roles of j and j' and recalling that the adjoint of A 3 J 
is A n 3 . This leaves the case when the ratio converges to a finite positive number; in 
this scenario, C-j 3 and its adjoint both converge strongly and so may be neglected 
in what follows. 

Looking back at the definition of £*, we see that if X J n \i/g | — > oo, then 

Eg' Ft? Gg 4> and ^ 
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have disjoint supports for large n, which proves weak convergence to zero. If on 
the contrary X° n v-tf converges then, by observation (ii) above, E-tf and its adjoint 
converge in the strong operator topology on £>(L^(M 2 )), and we may disregard it 
in what follows. 

It remains only to show that 



n^oo 



lim / ^m e M-K f ^/K)-^yl j '/K}^ = o (5.9) 



whenever (X 3 n )~ 2 \s™ | — >• oo or \/V n — > oo. In the former case, this follows from 
the van der Corput lemma after noting that (£/A) > A - 2 (£/A) — 1 . If only the 
latter sequence of parameters diverge, then (5.9) reduces to the Riemann-Lebesgue 
lemma. 

This completes the proof of Lemma 5^ . □ 

We end this section with a few propositions that will be useful when we apply 
the linear profile decomposition in Section |t] to extract a minimal blowup solution. 

Proposition 5.3 (Energy decoupling). Suppose {v n } is a bounded sequence of H x 
func tions. Then after passing to a subsequence, the linear profile decomposition 
(5.1) satisfies the following: for each J < Jq, 

J 

lim {E(v n ) - ]T E(T xi e u ^L ui D xi P^) - = 0. (5.10) 

Proof. We will prove that the energy decouples in the inverse Strichartz theorem, 
that is, in the case J = 1; the general case follows by induction. Furthermore, by 
( |5~3| ) , it suffices to show that 

]im{||Rei;„||i< - || Re0„||t 4 - || Rew n f Li } = 0, (5.11) 

where 

</>n = T Xn e it ^L Vn D Xn P n <t>, 

with P n — 1 if A„ = 1 and P n = P<\e if A„ — > oo. 

We start by considering the case when A„ = 1 ; recall that in this case we in fact 
have 4> € ^i, 

(because v n = § and P n is the identity), and either t n — ¥ ±oo or t n = 0. Approxi- 
mating <f> in H x by Schwartz functions and applying the dispersive estimate, we see 
that 

\\e itn{v ^(f)\\ L 4 when t n -J- ±oo. 

Claim ( 5.11 ) now follows easily. Next we consider the case A„ = 1 and t n = 
0. By (5.4), we have T^ Xn w n — Y 0, weakly in H x . T hus, by Rellich's theorem, 
a subse quen ce of T_ Xn w n converges a.e. to 0, and so ( 5.11 ) follows by applying 
Lemma 2.10] with F n = ReT_ Xn v n and F = Re<j>. 

It remains to consider the case when A„ — > oo , which we treat with the following 
lemma. 

Lemma 5.4. If X n oo, then 

lim \\<f> n \\ Li -0. (5.12) 



n— >oo 
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Proof. We will use Bernstein's inequality, which implies that 

. . 1/2 

ll^nlU* ^ [diam(supp </>„)] ||</>n|Uj- 



(5.13) 



We note that since |%4 n (£)l ^ (fn) for £ e K 2 , by Q2.14Q we have 
diam(supp</>„) = diam(supp([L,, n .DA, l -Pn</>H) 

< (v n ) di a m(supp([ J DA n P n ^D) < K)A£ _1 . (5.14) 
Furthermore, by Lemma [2.4| , we have 

H^IUj - \\L Un D Xn P n <P\\ Li < (v n )\\4>hl- (5-15) 
Using ( 5.14 ) and ( |5.15 ) to bound the right side of ( 5.13 ), and then using bound- 



edness of the v n , we obtain 



and the lemma is proved. 



□ 



By Lemma 5.4, we have \\v n — w n ||^4 — ► 0, and (5.11) follows. This completes 
the proof of the proposition. □ 

Proposition 5.5 (Decoupling of nonlinear profiles). Let ip 3 and tp 3 ' be functions 
in C^°(R xl 2 ). Given parameters u° n , v 3 n , (P n ,x 3 n ), (P n , x J n ), X J n , A^ as above, we 
define i\) 3 n by 



[r n (- + t{, ■ + x{) o L-}\ (t, x) := ^ (j^, ^) , 



(5.16) 



and similarly for ip 3 n . Then under the orthogonality condition (5.5), we have 

lim \\ip J n ip J n || L 2 (RxR2) = 0. (5.17) 

Proof. Let 

L*(t,x) := {{v)t - ^■x,x ± + (v)x^ - Xist) 

and let X 33 = X 3 n /X 3 n . Additionally, let R 33 be spatial rotations and v 33 be boost 
parameters such that 

L v , : oL- 1 (t,x) = L vij ,(t,Rii'x) 



(cf. the proof of Lemma 5.2). Recall that \vff | ~ — v 3 n \. 

With this notation (using the fact that spatial dilations and rotations commute) , 
we compute 



xR 2 



< 3 \(K J ) 2 K J J \ 



(K ) 2 



o L% (t, x) 



(5.18) 



dx dt. 



As before, by passing to a subsequence, we may assume that R 33 converges. By 
absorbing the limit into ip 3 and using continuity, it suffices to treat the case when 
R 33 is the identity. 

If X 33 — > 0, then by Holder's inequality and ( 5.15 ), we have 

Wj,tfdxdt < {x 33 ' ) 2 ||^||1j H^llLo. o. 
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Similarly, if A^ 7 — > oo, we have 



\^:\ 2 dxdt<(x^'r 2 \\^\\i r w'\\ 



0; 



here we have used the fact that is volume-preserving. We may thus assume that 
A^ converges to some positive number, and arguing as above, it suffices to treat 
the case when Xtf = 1. 
Let 

S»' = supp(^) n su PP (^' (• - • - o /.; .). 

If (t,x) £ Sff , then \t\ + \x\ < 1 because supp^ is compact, and 



< l 



beca use s upp^ J is compact. Thus if |A^z^ J | — > oo, then by Holder's inequality 
and ( 5.18 ) we have that 



Finally, if A^ remains bounded while 
ally empty. This completes the proof of the proposition. 



oo, then S-tf is eventu- 
□ 



6. Isolating NLS inside nonlinear Klein-Gordon 



In this section, we consider the mass-critical nonlinear Schrodinger equation in 
the form 

(6.1) 



(id t + ±A)w = /zflHV 



with fi = ±1 as in (1.1) and (2.2). This normalization of the nonlinear Schrodinger 
equation appears naturally in connection to NLKG and can be reduced to (1.6) by 
rescaling w and x. Correspondingly, the ground state solution associated to (5.1) 

is WQ(t,x) := e~ lt ^/|Q(-\/2 x) where Q is as in @. Note that M(w Q ) = §M(Q). 
Under this rescaling, Conjecture 1.3 takes the following form: 

Conjecture 6.1 (Global well-posedness of NLS). Fix a value of /i = ±1. Let 

u>o £ L 2 (M. 2 ) and in the focusing case assume that M(wq) < |M(Q). Then there 
exists a unique global solution w to (5.1) with w(0) — wo- Furthermore, this solution 
satisfies 

IMk? iX (RxR=) < C(M(w )), 

for some continuous function C . As a consequence, the solution w scatters both 
forward and backward in time, that is, there exist w± £ L 2 such that 



\w(t) 



it A/2 | 

e 1 w ± \\ L 2 



— > as t —¥ ±oo. 



Conversely, for each w± there is a global solution w to (6.1) so that the above holds. 



The goal of this section is to prove the following theorem: 
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Theorem 6.2. Assume that Conjecture 6.1 holds. Let sequences v n — > v £ M 2 , 
X n — > oo, and {t n } G R be given. Assume that either t n = or t n /X„ — > ±oo. 
Let 4> £ L 2 (R 2 ), and in the focusing case assume also that M(<fi) < |Af(Q). If we 
define 

K :=T Xn e u "WK n D Xn P< xs J 

for 9 = y^j, then for each n sufficiently large, there exists a global solution v n to 
(2.2) with initial data v n (0) = <p n , which satisfies 

Sm{v n ) <M(0) 1- 

Furthermore, for every e > 0, there exist N e and a function tp s £ C C °°(R x R 2 ) such 
that for all n > N £ , 

\\Re{v n o L~i(t + t n ,x + x n ) - ^^ e {jt, ^)}|| i 4 3;(RxE 2 ) < £ (6-2) 

where (i n , x n ) := L Vn (t n , x n ) is the center of the wave packet in boosted coordinates. 



Remark 6.3. As we will see in due course, the proof of Theorem 1.4 in the focusing 
case relies only on the consideration of those <b with M(<f>) < M(Q). Thus, the full 



Conjecture |L3| (or equivalently, Conjecture |6J) covers more cases than are needed 
here. In a similar vein, a proof of Conjecture 1.3 in the focusing case up to some 
intermediate mass threshold M* < M(Q) yields a corresponding result for NLKG. 

We begin with the proof in the case v n = 0, which adapts the ideas in [gl|, §4]. 
This result is then used to treat the general case. 

Proof in the case v n = 0. As the first order nonlinear Klein-Gordon equation is 
invariant under spatial translations, we may assume x n = 0. Thus, (6.2) will follow 
from 

* x Ml - (6.3) 



\V n {t + t n , x) — S j^-tfj E (jT, X^)\\ L 4 



< e. 



We begin by defining solutions to (6.1); we will later modify these to produce 
approximate solutions to (2.2). 

In the case when t n = 0, we let w n be the solution to (|6.lD with initial data 



w n (0) = P<\i<t>- 
Similarly, we let Woo be the solution to ( |6.l| ) with initial data 

Woo(0) = (p. 

In the case when t n / \ 2 , — > — oo (respectively t n j\ 2 n — > +oo), we denote by 
w n the solutions to (3.1) that scatter forward (respectively backward) in time to 
e itA/2p < ^ e Correspondingly, we define Woo to be the solution to ( |6.l| ) that scat- 
ters forward (respectively backward) in time to e ltA ^ 2 cj). (The signs here are correct 
because if the bulk of the solution is living around time t n —¥ — oo, then time is 
well inside the forward scatte ring regime.) 

As we assume Conject ure |6.l| holds (and that M{4>) < $M(Q) in the focusing 



case), all the solutions to (|6.1[ ) defined above are global and moreover, 

Sr(w„) + Sr^Woo) <m{4>) !• 

We begin with a few basic observations about the sequence w n , which will be 
helpful in what follows. 
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(6.4) 
(6.5) 



Lemma 6.4. For s > ; the solutions w n defined above satisfy 
and 

\\(V) s d t w n \\ LU < MW X^ e . 
Furthermore, we have the approximation 

lim \ ||w„-w 00 || L? o i 2 +\\w n -w 00 \\ L 4 +\\Dx n (w n -P <x ew 00 )\\ i \ = 0. (6.6) 

Proof. The first inequality follows from persistence of regularity arguments (cf. [ fl3[ 
Lemma 3.10]) and the fact that by Bernstein's inequality, 

|||V| s F< A «0|| L , (R2) < \ s n 9 U\\ LUm - 



To prove inequality (6.5), we use the equation (S.l) together with (6.4), Holder's 
inequality, and Sobolev embedding: 

iKvr^ii^ 

< IKV^AWrcH^ J RxR 2) + ||(V) ; '-U;„|| i 4 L 12( R><R 2)||w„||^= oi 12( R)<R 2) 

<m W Ai 2+S)e + IKV) 1 / 3 ^^^^^,^)!!^! 5 / 6 ^!!^^^,^) 

That the first two terms in ( |6.6| ) tend to zero is a consequence of the stability 
theory for the mass-critical NLS; this result may be found in J23| or p^ |. 

We turn now to the final term on the left side of d6.6| ); changing variables and 
using the triangle inequality and ( |6.4[ ), we obtain 

\\D\ n {Wn ~ P <\i W oo)\\ L?aH i/2 

= IKA-'v^K - P<A^oo)IU ri 2 

< ||(A j ; 1 V) 1 / 2 P> A „ W „|| iri 2 + \\{X n 1 V) 1 ^ 2 P<\ n (w n — w oc )|| L oo L 2 

+ ||(A- 1 V) 1 / 2 P A ,<.< Aii i i ; 00 || iri 2 

< \T 1/2 |ll V | 1/2w n|lL-L2 + \\Wn ~ WocjU-L 2 + 1 1 P> A» «'=o 1 1 L t °° L 2 



< 



M(0) 



^1/2+8/2 + _ Woq \\ 2 + ||p g Woo \\ LrL2 



It is immediate that the first term on the right-hand side above converges to zero 
as n — ¥ oo , while the convergence to zero of the second term follows from ( 3T ) . It 
remains to consider the third term. 

By our assumption that Conjecture 6.1 holds, Woo scatters both forward and 
backward in time; let w± £ L\ be the scattering states. Then 

|P>A6U' 00 ||l~L2([T,oo)xR 2 ) + ||P>A« w oo||l~L 2 ((-oo,-T]xR 2 ) 

< ||Woo - e lt ^ l2 W+^ L f Ll{{T,oo)y.9?) + \\Woo - e ltA/2 W- || L oa L 2 _ T] xR2) 

+ \\ P >\ e n W +\\Ll(M?) + II^A^W-Hl^K 2 ), 

which can be made arbitrarily small by choosing T and n sufficiently large. On the 
other hand, for each fixed T > 0, the continuity of the mapping 1 1— > u>oo(i) together 
with the compactness of [— T, T] and the fact that the sequence of operators P> A e 
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is uniformly bounded and converges to zero in the strong operator topology on 
Ll(R*) yield 

lim H-P>A« W°o| | Lf Lin-T,T]KM?) =0. 
n— >oo — n i 

This completes the proof of the lemma. □ 



We now use the solutions w n to NLS to construct approximate solutions to (2.2). 
Let T be a large, positive real number, to be determined later. We define 

f e- u D Xn w n {t/Xl), if \t\ < TXl, 

v n (t) := I e-^*-^)^)^^), if t > TXl (6-7) 
[e-^+ T ^Wv n (-TXl), if t < -TXl 

with the idea that v n (t) — v n (t — t n ) will be small. Ultimately however, our approx- 
imate solution will be a further modification of this. In particular, we will make 
an adjustment on the middle interval which is small in H^ 2 {R. x M 2 ). This will 
result in an analogous change on the outer intervals, but as we will show (using the 
Strichartz inequality), the modification is negligible in this regime. 

Forgetting the above-mentioned technical issues for now, we give an explanation 



as to why v n might be an approximate solution to (2.2). On the middle interval, we 
can use the estimate ( [2.24 ) to show that the above transformation takes solutions 



to the linear Schrddinger equation to approximate solutions of the first-order linear 
Klein-Gordon equation. The behavior of the nonlinearities on this interval is a 



bit more mysterious, but the specific factor I appearing in (3.1) will ensure that 



certain resonant error terms cancel, while the non-resonant error terms will be 
subdued via the use of X s ' b -type estimates. As t tends to infinity, the differences in 
the two dispersion relations become amplified and the approximation breaks down. 
Fortunately, by this time the NLS solution is well dispersed and so resembles a 
free evolution. This behaviour is inherited by the Klein-Gordon evolution as we 



intimated in (6.7); see also Lemma 3.S. 



Wc now set about supplying the details behind these heuristics. As each heuris- 
tic i ntro duces some small error, we will need to use the stability theory (Proposi- 



tion ^4 ) to construct the final solution v n . This eventuality dictates which estimates 



we need to prove, beginning with the following: By the Strichartz inequality and 



Lemma 6.4 



>) ^ \\Dx n w n \\ LTHl ,, + \\D Xu w n (t/Xl)\\ 
^I + A^IIIVI 1 ^ 



i-f^n lll v l W n \\L?Ll 

<M W 1 + X- 1 '^ 9 ' 2 < MW 1. (6.8) 
Lemma 6.5 (Matching initial data). If v n is given by ( |6.7|) , then 

lim limsup \\v n (-t n ) - 0„|| i? i/2 fR2 , = 0. (6.9) 

Proof. In the case when t n = (recall that <f> n = e ttn ^ D\ n P <x e <f>, as we assume 
x n = v n = 0) we have v n {— t n ) = <p n , so there is nothing more to prove. 

Consider now the case when t n j}? n — > — oo; the case t n IX 2 n — > oo can be treated 
similarly. In this case, for any finite choice of T, we eventually have that —t n > A^T. 
Thus, we can rewrite ( |6.9| ) as 

lim limsup \\e-* T ^D Xn w n (T) - e - jA " T < v >£ A „P< A ,0|| ffl/2 = (6.10) 
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By the triangle inequality, 

\e- iT ^D Xn w n (T) - e-^ T ^D Xn P< xe J\\ Hl 

< ||-Da„ [w n (T) - P< A 8 Woo(T)] || ff i/2 (R2) 

+ ||P A „P< Ae [ Woo (T) - e <r(i-<A-v»^ 



< ||D Ab K(T) - P< A ^oo(T)] || ff i/2 (R2) + IkocCO - e' iA " T(1 - <A " lv>) 0lU2 (R2) 

< ||£> A „ [™„(T) - P< A ^oc(T)] || ffl/2 + ||«7oo(T) - e tTA/2 0||L2 (R2) 



[1 



e 



iTA=(l-(A- 1 £)+|A- 2 |€| 2 )l 



Now ( |6 .10 ) follows from (|6.6|), the definition of (in the case t n j}? n — > — oo), and 
the dominated convergence theorem, respectively. □ 



Next, we show that are approximate solutions to (2.2), starting with the large 
time intervals. More precisely, we will prove 

Proposition 6.6 (Large time intervals). With the notation above, 

lim limsuplle-^-^^S^T^JIIij (( ^ Tfao)xRa) =0 

and analogously on the time interval (— oo, — A„T). 

This proposition is an immediate consequence of the two lemmas that follow; 
indeed, one merely needs to combine them with the triangle inequality. The first 
lemma shows that for a large enough time, we can safely approximate the nonlinear 
solutions w n by solutions to the free Schrodinger equation. 

Lemma 6.7. Let w+ be the forward-in-time scattering state of the NLS solution 
Woo defined above. Then 

lim hmsu P || e -*-^)<v> [v n (TXl)-e-^ Id x ^ a ' 2 P^ W+ ] || ((A2Too)xR2) = 0. 

A similar statement holds on the time interval (— oo, — A^T), but with w- in place 
ofw+. 

Proof. We will give the argument for the forward-in-time statement. By the Strichartz 
inequality for the first-order Klein-Gordon propagator, it suffices to prove 

lim limsup \\v n (TX 2 n ) - e- lTX ~D Xn e tTA / 2 P< x ew+\\ Hl/ 2 = 0. 

By Lemma 3.4, it suffices to show that 

lim limsup||£> A?i P< A e [w^T) - e lTA/2 w + ] \\ H i/2 (R2) = 0. 

To see this, we note that 

||#a„P<a» Ko(T) - e lTA / 2 w+] \\ H m m < \\ Woo (T) - e lTA / 2 w + \\ Ll(M2) , 

which converges to zero as T — > oo. □ 

From the previous lemma, we know that £ ra (TA^) is well approximated by the 
free Schrodinger evolution of a specific function w+. The second step in the proof 
of Proposition 3.6 is to show that for T large, the free first-order Klein-Gordon 
evolution of this function (into the future) is small. Colloquially, a solution that is 
well-dispersed for Schrodinger is also well-dispersed for Klein-Gordon. 
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Lemma 6.8. Let ip E L X (M. 2 ) and let A„ — > oo be a sequence of positive numbers. 
Then 

lim limsup||e- i ( t - A » T )< v >e- iTA »£> A „e iTA / 2 P< A .V'llLf I ((A=iT,cx ) )xR=') - 0. (6.11) 



Proof. By the Strichartz inequality (2.5) together with the easy inequality 
\\{V) 1/2 D Xn P< K {^ - ^)\\ Lim < ||V-^||^(R»), 

we may assume that "0 is a Schwartz function with compact frequency support. 
Consequently, for n sufficiently large, supp?/> C {|£| < X 6 n } and we can therefore 
ignore the projection operator P<\e in what follows. 
Next, we set 

f n (t) := e -«<^ 1 V> e ^[(A- 1 V)-l+A/(2A^)] V) 

and observe that with this notation, ( |6.11 ) becomes 

lim limsupA^ 1/2 ||/ n || L 4 ((A 2 T )xR 2) = 0. 

The proof will be via a stationary phase argument. We write 
where 

KAO ~-%{x-Z- t{X-H) + T\l [{X~H) 1 - A- 2 |^| 2 /2] }• 
A computation yields 



fn(t,x) 



1 - 



a;t 



where 5a denotes the Kronecker delta. Thus, 



d i d j h n , x (0 = s ij + o(x-^- 2 ^), 



(6.12) 



uniformly for |£| < A 29 and t > A 2 T. IiVh n , x does not vanish in {|£| < A 26 }, then 
by ( 6.12| ), |V/i n , x | > 1 uniformly on supp^ C {|£| < X e n }. If Vh n ^ x vanishes in 
< then by the Morse lemma (cf. ||, p. 346]) and ( |6.12| ), there exists rj n>x 

which is a diffeomorphism (uniformly in n, x, £) on suppV> and such that h n>x (£) = 
\Vn,x{0\ 2 + c n,x- In either case, by the principle of stationary phase, we have for 
t > A 2 T and x E K 2 that 

A 2 

\fn{t,x)\ <v, -f . 

By interpolation with the trivial L 2 bound, we obtain 

ll/n(*)|Uj(R2) <</> (V) 1/2 - 
Integrating this with respect to time we get 

II/"IIl* x ((A2T,oo)xR 2 ) i$V X]J 2 T 1 / 4 , 
which completes the proof the lemma. □ 



44 



ROWAN KILLIP, BETSY STOVALL, AND MONICA VISAN 



We now turn to showing that v n is an approximate solution to (2.2) on the 
middle time interval. A computation shows that on this middle time interval, v n 
satisfies the following approximate first-order Klein-Gordon equation: 

{-id t + (V))u„ + //(V) _1 (Re v n ) 3 = e x + e 2 + e 3 + e 4 , 



(6.13) 



where 



ei '■= G H D\ n | [(A^ 1 V) - 1 
e 2 ^[(V^-l] (Rev n ) 3 
e a :=iRee^'k„ W „(^) 



2A- 



e 4 



3 e lt |i? A „ Wn ( 1 4 r )| z ^ A „ W „(^). 



The error terms ei and e 2 can be estimated in spaces for which Proposition 3.4 



applies. We will estimate e\ in L\H^ 2 . As 



(A n - 1 - 2A^ 



< 



2 A* ' 



by Holder's inequality and (pM, we obtain 



ei 



LlH¥ 2 ([-A 2 T,A 2 T]xR 2 ) 



< rA n 2 (ll A2 W„||L r L2 (R><R 2 ) + A„ 1/2 ||(V) 9/2 W„|| LrL 2 (RxR 2 ) 



< T\- 2+4e as 7i oo. 
Next we estimate e 2 in L^ 3 Wx' 4 ^ 3 . Noting that 
(V)[(V)- 1 -1]=V- 



(6.14) 



1 + (V> 

and that by the Mikhlin multiplier theorem the second factor on the right is bounded 
on L 4 ' 3 , we have 



l(V)e 2 | 



< 



. ([-^T,A2T]xR 2 ) 
r, \3| 



||V(Ret>„) 



L^([-\lT,\lT]xR 2 ) 
~ H £>A "^" m "(^")llL* x [-A2T,A2T]x 



|£>A„^n(p") 



SxR 2 ) 



< a: 



as n 



oo. 



(6.15) 



In the last inequality we used ( p.4[ ). 

Unfortunately, the error terms and e 4 are not small in either of the spaces 
L\H^ 2 or L^ 3 Wx' 4 ^ 3 for which Proposition 3.4 applies. However, they oscillate 
in spacetime like e ±3lt and e lt , respectively, and the frequencies ±(3,0) and (1,0) 
are far from the surface {(— (£},£)}• This anows us to use A s ' b -type arguments in 
the manner of ||. 

Lemma 6.9. For j = 3,4 let / nj solve the equation 



(-id t + (V»/„ 



/n,i(0) = 0. 



Then 



\fn,i\ 



L£°ffi /2 ([-A 2 T,A 2 T]> 



) + ll/njIlL^CI-AjT.A^TJxR 2 ) 



< a; 



-2+30 
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Proof. We will prove the lemma for j = 4. The argument for j = 3 is almost 
identical. We compute 

■ it 

(-id t + (V»(/„,4 - ±e 4 ) = ^{^- [dt{w n w^)] (\- 2 t, Xn x) (6-16) 



^[((A^V) - l)K^ 2 )](A- 2 t,A- 1 : r)}. 



n: — I .... . 1/2 

Thus by Lemma 2.3 and the triangle inequality, it suffices to bound e 4 in L^°H X 
and Lj x and the right-hand side of the identity above in dual Strichartz spaces. 



Estimating much as in (6.15), we obtain 

H e4 llL t ° c i/y 2 ([-A2T,A2T]xR2) ~ IK V ) 1/2 - D A„Wn||L~L6(KxR2)||£ ) A„U'n||i t °oL6( R><R 2) 

< A- 2+2e . 



Similarly, using Sobolev embedding and (5.4), we have 



H e4 lli'f, a! ([-A^T,A2T]xH a ) ~ \i II^JUf^RxR 2 ) 

< X n 2 || W n ||l 4 L12(RxR2) II W n || l~li 2 (RxR 2 ) 

< A^ a |||V|V3 1Un || i « B(RxRa) |||V| B /Vlll«.£ S( R X R» ) 

<K 2+2e - 

We turn now to estimating the right-hand side of ( |6.16 ). Noting that 

(V)[(V)-1] = -A[1 + (V)- 1 ]- 1 

and that by the Mikhlin multiplier theorem the second factor on the right is bounded 
on L^ x , we estimate 

< v >{ J[«\t 1v > - ijK^K^a-'i)} 

~ K 2 {\\Aw n \\ L 4^( RxR2) \\w n \\ 2 L 4 J RxR 2) - 



L V,x ( h T > T l * R2 ) 



|Vw n || 2 4 



<R J ) ll^n || x (RxR 2 ) j 



< A^ 2 + 29 . 



Note the applicat ion of ( 6.4 



in the last step. 



Finally, using ( |6.4| ) and (3.5), 

(V){^[d t (w n w^ 2 )](\-H,\- l x)} 

< \- 2 \\(V)d t (w n w^ 2 )\ 

^ ^{iKv)^^!!^^^!!^!! 2 ! 



IxR 2 ) 



+ ll^t w n|lLf :c (RxR 2 )ll( V ) w n|lL* !c (RxR 2 )ll w n|lL* :1; (RxR 2 )} 

< A- 2+3e . 



□ 



This completes the proof of the lemma. 

We now make the promised modification in v n and show that the modified se- 
quence approximately solves (p^). With / nj as defined above, we consider the 
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sequence 

f v n (t) - /„, 3 (t) - / M (t), if |t| <TA 2 „, 

«»(*) := ^ e^'-^lWs,^), if t > TA 2 , (6.17) 
[c-*(*+^)<v)= n( _ TA a )) if t< _ TA 3. 

The key facts about v n are that it is a good enough approximate solution to allow 
us to invoke Proposition 3.4 and tha t it is close enough to v n to allow us to deduce 
that the resulting solutions v n obey (6.2). We collect these together in the following 
proposition: 

Proposition 6.10. For each e > there exist T and N so that for each n > N , 
{-idt + (V)K + /i(V)- 1 (Rci)„) 3 = ex + e 2 + e 3 , 

with 



+ II (V)(e 2 +e 3 ) L4/3, 



:xi ! ) 



< e. 



Moreover, 



\v n v n\\ L? o H 



1/2, 



+ \\v n - V- 



n\\L* 



txR 2 ) 



< e. 



(6.18) 



^M(0) I- 



7n particular, by ( J6.8| ) we /iaue ||«n|| i0 o fl -i/2 (RxmJ , + H^Hl^r 
Proof. Let 7 n := [— A 2 T, A 2 T]. On this interval, direct computation reveals 
ei=ei, e 2 = e 2 , and e 3 = A*(V) _1 (Re(6„ - /„ j3 - /„,4)) 3 - (Re t)„) 3 



By (6.14) and (6.15), we have 



l(V)e 2 | 



L* / a f(/„xR 2 ) 



<TA- 2 + 49 + A- 1+9 . 



On the other hand, by the triangle inequality, Holder, ( p.8| ), and Lemma p.9| , 

ll( V )e 3 || i 4/3 (/iiXK2) < [||Un|liJ ias (J„ x R2) + !l/«.3|li4 x(/nXR 2 ) + ||/T.,4|| L 4 a!( J nXR2) ] 

x [ll/n,3||Lf x (/„xR 2 ) + ll/n,4||Lf x (/„xR 2 )] 

These bounds show that for any T one may choose N sufficiently large so that for 

n> N, 

INI^/ 2 (/ nXR 2) + IK V H g 2 + e3)|| i; /« (/BXR1) < |e. 

For the complementary time intervals, ei = e 2 = and e 3 = /Lt(V) _1 (Re{i n ) 3 . 
By Proposition 6.6, Lemma 6.9, and the Strichartz inequality, we have that for T 
and n sufficiently large, 



|(V)e 3 | 



L A t '!(\t\>TXl) 



< 



< 



l^,,(l*l>TA 2 ) 



l£t,,(l*l>TA 2 ) 



In,'. 



\L?°H 



i/ 2 + \\fn,- 



\L?°H 



1/2 



We now turn our attention to (3.1S). The contribution from /„ is controlled 
by Lemma 6.9, while the contribution from the complementary time intervals is 
controlled by combining this lemma with the Strichartz inequality. 

This completes the proof of the proposition. □ 
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We are now rea dy t o complete the p roof o f Theorem |6.2| in the case v n = 0. 
Combining Lemma 6.5 an d P roposition 6.1C , we are in a position to apply the 
stability result Proposition 3.4 with v n as the approximate solution, and so obtain 
(for n sufficiently large) a solution v n to (2.2) with initial data v n (0) = 4> n and 
finite scattering size. Moreover, by d6.18| ), 

lim {\\v n (t) - v n (t - i n )||roo i? i/2 m xR2) + \\v n (t) - v n (t - i„)IU* (RxR 2 )} = 0- 

(6.19) 

Finally, we verify ( |6.3| ) . By the density of C£° in L\ x we may choose ip e so that 

Combining this with ( 6.19| ), we see that it suffices to show 

\\v n - e" It L>A„w 00 (A I 7 2 t)|| i 4 x(R><R 2 ) < \e for n sufficiently large. 
By the definition of v n and the triangle inequality, 

\\v n - e" It £)A„w 00 (A^ 2 i)|| L 4 x(RxR 2 ) 

~ ll^llif x (|i|>TA2) + \\w n - Woo\\ L i |j; ( lxl 2) + ||^oo|U| ia .(|t|>T)- 

Each of these can be made arbitrarily small by first c hoos i ng T large and then n also 
sufficiently large; specifically, we apply Proposition 6J3, (6J;), and the dominated 
convergence theorem, respectively. 

This completes the treatment of Theorem 6.2 in the case v n = 0. □ 

We now turn to the general case, in which our only assumption on v n is that 

v n -)• v e M 2 . 

Proof in the general case. Recall that (t n ,x n ) :— L Vn (t n ,x n ) and hence, by the 
commutation rule (2.15), 

K = T Xn e u ^L Un D Xn P< x eJ = h Vn T £n e il "^ D Xn P< x oJ. (6.20) 

By spatial translation invariance, we may alter x n to whatever value is convenient 
(previously, we set x n = 0). For this part of the proof we choose x n = v n t n j (v n ), 
which has the effect that x n = and t n — t n /(i> n ). 

By our proof in the case v n = 0, for n sufficiently large there is a global solution 
v„ to (2^2) with initial data 



Moreover, it obeys <Sk(v°) Sm(4>) 1 and for each e > 0, there exists if)® € C° 
and AT? such that 



|Re{«°(< + t n , x + x n )- 



< £ 



(6.21) 



3.22) 



whenever n > N9. 



As i>° solves (|2.2|), so it ° :— Reu° solves (1.1) and thus by Lorentz invariance, 



u' n := o L Vn also solves (1.1). Note that it is necessary to pass through real 
solutions here since 

v l n :=(l+i{V)- 1 d t )u 1 n = {l + i{V}- 1 d t )Rev°oL Vn (6.23) 

(cf. (2.1)) will not equal v® t o L Un except in some exceptional circumstances. The 



former solves (|2.2|), while the latter need not. 
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By the manner in which it is constructed, we expect (t) to be a good approx- 
imation to the sought-after v n (t). Both are solutions to (2.2); however, they have 
different initial data because L„ n only faithfully represents the action of Lorentz 
boosts on solutions of the linear Klein-Gordon equation. Nevertheless, we will now 
prove the discrepancy to be small, which will allow us to apply the stability result 



Proposition 3.4. 



Proposition 6.11. For n sufficiently large, v„ is a global strong solution to (2.2). 
Moreover, sup n 5 R (^) <m{4>) 1 a nd 

lim K(O)-0„||m =0. (6.24) 



Proof. By Corollary 3.7, we have that is a strong solution to (1.1). This implies 
that v„ is a strong solution to (p^). As SrIv^) = Sr(v%), the spacetime bound is 
inherited directly from 



We now turn to the initial data. As in the proof of Corollary 3.7, we decompose 



where 



,0,lii 



solves the linear Klein-Gordon equation with initial data 



(i + ^vr^x^co) = Vn (o) = l-v„. 

Then by the action of L„ on linear solutions (cf. (|2.13| ) or (3J3)), we have 

(1 + »<V)- 1 fl e )[<' lln o L u J{0) = L Un v n (0) = cj> n , 

from which we deduce that ||v*(0) — i^nllff 1 = o L Un (0, ■) II 1 • 

By construction, u„(0, •) = 0. Thus, we need only estimate the contribution 
from the nonlinearity in the spacetime region 

il n = {(t,x):Q< (v n )t < -v n ■ x) U {(t,x) : -v n ■ x < (v n )t < 0}. 



To do this, we argue in much the same manner as in the proof of Corollary 3.7 
using in place of the u appearing there. As in ( 3.12 ), 



limsup o L„„(0, -)\\ H i < limsup // |V ' t , x ,■ p„| dx dt 

n^>oo n->oo J J Q, n 

< limsup // \u n (t,x)\ z \\J t ,xU° n (t,x)\dxdt 

n— >oo J JQ n 



< limsup \\u°J 3 L{ 



,(«„) 



|Vt,»«°llif 



Therefore, to complete the proof of the proposition, we merely need to verify the 
following: For n sufficiently large, 



IVtXlk. 



txR 2 ) ^M(4>) 1 



and 



lim ||<|| if M = 0. 

n— too r ' xv ' 



(6.25) 
(6.26) 



.O.lii 



We begin with (6.25). By the triangle inequality, 

||Vt>°||i^ < ||V t ^u°|| L 4 x + ||V t ,X' 

Furthermore, by the Strichartz inequality, the linear term satisfies the bound 



|V t X< lin || L 4 x < \\v° n (0)\\ H s /2 = \\D Xn P<xeJ\\ H V2 <M 
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To bound the contribution coming from u° , we use persistence of regularity (3.3) 



M{4>) 



1 to see that 



and the fact that 5m 

l|Vt,x<| 

This completes the proof of ( |3.25| ); we turn now to ( |6.26 ). 

By the approximation ( 6.22| ) and the triangle inequality, it suffices to show that 



<M W \\(V) 3/2 D Xn P< x oJ\\ L 2 < MW 1. 



lim 

n— too 



t — in X — X 71 



dx dt = 



3.27) 



for every tp e C° 



To do this, we consider the support of the integrand. As x n = 0, 



t — t n X — Xj, 



y-^s-, -^-j e supp-0 => |x| <</, A„, while (t,x)efl n = 
Therefore, the support of the integrand has measure <^ and so 
LHS(@) U A- 4 A?JH| L ~ U K 1 -> 0. 



i < H 



This completes the proof of ( p. 26] ), and so the proof of Proposition 6.11 



□ 



We now return to the proof of Theorem 6.2 in the general case. Combining 
Proposition 6.11 with Proposition 3.4, we deduce that for n sufficiently large there 



exists a global solution v n to (2.2) with v n (0) = 4> n and 5r(w„) <mU) 1- Moreover 



Km ||Re{« n — u n }|L 4 = 0. 



Combining this estimate with Rew,° = Rew,^ o L u ^ and ( |6.22 ) yields ( |6.2p , thus 



completing the proof of Theorem 6.2 



□ 



7. Minimal-energy blowup solutions 



The goal of this section is to prove Theorem 1.9, which asserts that failure of 



of our main result, Theorem 1.4, would imply the existence and almost periodicity 



(modulo translations) of minimal-energy counterexamples. 



As described in the introduction, if Conjecture 1.2 were to fail, then there would 



exist a critical energy E c > (also E c < E(Q) in the focusing case), defined to 
be the unique positive number pos sessing the following properties: First, if u is a 
real-valued, global solution to (Q with E(u) < E c (and M(u(0)) < M(Q) in the 
focusing case), then 5r(u) <_b( u ) 1; second, there exists a sequence u n of global 
solutions to (|l.l| ) such that E(u n ) < E c (and M(u n (0)) < M(Q) in the focusing 
case), limn^oo E(u n ) = E c , and lim^oo 5r(m„) = oo. 

We pause now for two remarks on the preceding discussion. First, the fact that 
E c > is a consequence of the sm all-da ta theory presented in Proposition In 
the focusing case, we also invoke (2.34) to see that for M(u(0)) < M(Q), small 
energy implies small H\ norm. 

Second, the solutions u n used in the definition of E c are stated to be global (in 
time). This involves no loss of generality as can be seen in at least two ways: Either 
(a) we choose E{u n ) to converge to E c from below; thus, not only is u n global but, 
by the definition of E Cl even admits global spacetime bounds. Or (b) we invoke 
Corollary |3.3|. 



The main step in proving Theorem 1.9 is the following proposition. 
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Proposition 7. 1 (P alais-Smale condition mo dulo translations). Fix /j, = ±1, sup- 
pose Conjecture \l.<\ holds but Conjecture l.i fails for this value of /j,, and let E c 
denote the critical energy. Accordingly, let u n be a sequence of global solutions to 
( |l.l| ) such that the following hold 

E(un) < E c and E c = lim E(u n ), (7.1) 

n— »oo 

M(u n (0)) < M(Q) in the focusing case, and (7-2) 
lim S<o(u n ) = lim S>o(u n ) = oo. (7.3) 

Then after passing to a subsequence, (u„(0), dtu n (0)) converges in x L? x , modulo 
translations. 



Proof. We will continue to work with the first-order version of our equation, (2.2). 
Correspondingly, let 

V n ■= U n + i(V}~ 1 d t U n - 

Thus, our goal is to prove that, after passing to a subsequence, v n (0) converges 
modulo translations in H*. Using Proposition 2.12 in conjunction with ( [7.2| ), we 
observe that in the focusing case, v n satisfies 

IMOjUlg =: M(u„(0)) < 2E C < M(Q), (7.4) 

and that in both the focusing and defocusing cases, we have 

K(0)||ffi <E{v n ) <E C . (7.5) 



As it is bounded in we may apply Theorem 5.1 to the sequence v n (0) to 
obtain a linear profile decomposition 

j 

«n(0) = + W t ±<J<Jo, (7-6) 

3=1 

where 

^=T xi ^>h vi D xi Pl4P. (7.7) 



Note that Jq > 1, for otherwise, (7.3) would be inconsistent with (p.2j). Passing to 
a further subsequence, we may assume that M(<jP n ) and E((p 3 n ) converge for each 
1 < j < Jo- 

From Proposition |5.3| , we also know that the energy decouples: 
J 

lim V E{<fi) + E{wi) = lim E{v n ) = E c , (7.8) 

3=1 

for each 1 < J < Jq. 

Lemma 7.2. After passing to a subsequence, one of the following scenarios occurs: 
Case I. There is only a single profile and it satisfies 

lim E{4i) = E c . (7.9) 

Case II. There exists 8 > such that for every 1 < j < Jq, 

lim EUl) <E C ~S. (7.10) 

n— >oo 

Irrespective of the above, in the focusing case we also have that for each j and J , 
M(4? n ) < M(Q) and M{wi) < M{Q) (7.11) 
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when n is sufficiently large (possibly depending on j, J). 



Proof. W e be gin with ( 7.11 ). By (5.4) and a simple inductive argument (as in the 
proof of (5.3)), we see that the mass decouples, that is, 

J 

lim (M(u n (0)) - V MicjA.) - M{w J n )\ = for each 1 < J < J . 



As masse s are non-nega tive, this and ( 7A ) imply the validit y of ( 7.11 ) . 

With (7.11) proved, (2.34) implies that the summands in (7.8) are positive for n 
sufficiently large (depending on J) in the focusing case; in the defocusing case this 
is manifestly true. This positivity allows us to deduce that if (7.9) fails then (7.10) 
must hold. For j = 1, this is obvious. For j ' > 2 we may set S — lin^^oo Ejffij, 
which is positive since (j) 1 ^ 0, by construction. □ 



Now we move to the consideration of the two scenarios described in Lemma 7.2 



Case I: Assume that ( |7.9| ) occurs. Then by (|7.q), together with ( 2.34 ) invoked 
/ 



for , we have that 



bl = wl -» in Hi. 



(7.12) 



Our analysis now breaks into three sub-cases. 



Case IA: Suppose that A* — > oo. We will apply Theorem 3.2, but in the focusing 
case we must first verify the following: 



Lemma 7.3. Assume that we are in the focusing case. Let 1 < j < Jq and assume 
oo . Then 

M{<jy>) < M(Q). 



that lim„_ ! . 00 \ 3 n 



Proof. By fl7.8| ) and Lemma 2.4, we have 

2E C > lim 2E(<&)= lim \\Tj e« v >L,, D^P^f^ 

Since 2E C < 2E(Q) = M(Q), the lemma is proved. 



□ 



Thus by Theorem |6.2| , for n sufficiently large, the solution to (2.2) with initial 
data v^(0) — <f> x n is global and satisfies Sr(^) <b c 1. We may now use ( 7.12| ) and 
Proposition 3.4 to conclude that iSr(u„) < oo, a contradiction. 

We must therefore have that A* = 1. This implies that (j) 1 € H^., 



and either iz — > ±oo or tl, 



Case IB: Suppose that A* = 1 and t\ — > ±oo. We treat the case — > — oo, the 



other case being similar. By the Strichartz inequality, e 
and so 



-i*<V>J,l 



2 )- 



,-i(t-tl)(V) -Ti 



L* x ([0,od)xR 2 ) 



as n — > oo. 



Hence by Proposition 3.1, if v\ is the solution to (2.2) with initial data v^{0) — 
4>n, then fo r n sufficiently large, S>o(v^) < oo. As in Case IA, we can now use 
Proposition 3.4 to conclude that for n large, S>o(v n ) < oo, a contradiction. 



52 



ROWAN KILLIP, BETSY STOVALL, AND MONICA VISAN 



Case IC: If A* = 1 and t x n = 0, we have reduced the linear profile decomposition 
©to 

T- x \ v n (0) =<p 1 + T_ x xw\. 



Combining this with (7.12), we have proved the proposition when (7.9) holds 



Case II: Wc will show that this is inconsistent with ( |7.3| ) by using (7.6) to 
produce a nonlinear profile decomposition of the v n and then applying the stability 
theory. We begin by introducing nonlinear profiles v 3 n ; their definition depends on 
the behavior of X 3 n . 

First assume that j is such that A^ = 1. Then <\P £ H\ and 



T 



If, in addition, t 3 n = 0, then we let v 3 be the maximal-lifespan solution to ( |2.2|) with 
vi(Q) = (j) 3 . If t 3 n — > — oo (respectively t 3 n —> oo), then we let v 3 be the maximal- 
lifespan solution to (2.2) which scatters forward (respectively backward) in time to 

Lemma 7.4. In Case II, if X J n = 1 for some j, then v 3 defined as above is global. 



Proof. This follows from Corollary 3.3. In the focusing case however, we must first 
establish 

M(Recj) j ) < M(Q) and E((j) 3 ) < E(Q). 



Since Af(0 J ) = M(<j^ l ), the first inequality is immediate from ( |7.1l|) . We turn to 
the energy bound. If t 3 n = 0, then E( cf) 3 ) = E(<f) 3 l ), and we are done. If t 3 n — >• ±oo, 
then by using the dispersive estimate ( pj.4[ ) and approximating cf> 3 in H\ by Schwartz 
functions, we see that 



lim E(<jP n 

n— )-oc 



lim 1 1 

n— >oo 



v n \\li = WfHi>E{4j), 



and hence by Lemma 7.2, E^cjp) < E c < E{Q) in this case as well. 
Thus if X J n = 1 , we may define nonlinear profiles by 



□ 



v 3 n (t,x) :=v 3 (t-t 3 n ,x-x 3 n ). 



Next, suppose that lim 



oo. Then by Theorem p.2| (and Lemma 7.3 



the focusing case), for n sufficiently large we may define v 3 n to be the solution to 
@) with initial data v 3 n (0) = (f> 3 n . 

Lemma 7.5. In Case II, for each j (regardless of the behavior of the X 3 n ) we have 



lim E(v 3 n ) = lim 

n—>oo n—t-oo 

lim M(vi(0)) < M(Q) in the focusing case, and 

U—>oo 

lim S R (<) < lim E{v 3 n ) 2 . 

n— ¥oo n—±oo 

Furthermore, for each j and e > 0, there exists ip — ip E £ C^°(R 
such that if i\> 3 n is defined as in ( |5.16 ) and n > Nj lS , then we have 



II Re(^ - v 3 n )\\ Li j RxR 2 } < e. 



(7.13) 
(7.14) 
(7.15) 

I 2 ) and N j>e 
(7.16) 
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Proof. Equality ( 7.13 ) is a tautology if X J n — > oo or X 3 n = 1 and P n = 0, since in 
these cases v 3 n (Q) — (j> ] n . If = 1 and t 3 n — > ±oo, then by the definition of v 3 n and 
(3.2), we have 

E(vi)=E(vi) = ill^ll^ = lim 
where for the last equality, we have used the dispersive estimate as in the proof of 



Lemma 7.4 



Inequality (7.14) follows easily from (7.11) and the definition of v J n . 



When lim.- 



E((p J n ) is below the small data threshold, (7.15) follows from 

and (2.34) imply that the 



Proposition 3.1. Note that in the focusing case, (7.11 



energy controls the H\ norm. On the other hand, by (7.S) the limiting energy can 
only exceed this threshold for finitely many values of j. For these cases, we invoke 
(7.1C) and the definition of E c . As we are invoking the contradiction hypothesis 
here, there is no hope of being explicit about the constant in (7.15) other than that 
it is independent of j. 

As for (7.16), in the case X° n = 1, this follows from the fact that v 3 n is just a 
translate of v 3 £ Lj r (R x I 2 ). In the case X 3 n — » oo, this approximation follows 
from Theorem 6.2. □ 



For 1 < J < J , we let 



V n J (t) :=£i£(t)+e-*< v >titf, 
i=i 

which is defined globally for n sufficiently large (depending on J). Our immediate 
goal is to use Proposition 3.4 to show that V^(t) is a good approximation to v n (t) 
when n and J are sufficiently large. 



Lemma 7.6. We have the following spacetime bounds on 

\\ V n\\ L °°H 112 } < 



lim sup lim sup {|| Rel^ 7 || i 4 

J— >oo n— >oo 



(7.17) 



The Vn are approximate solutions to (2.2) in the sense that 



(-*$ + <v>)tf + /x<v)- 1 (Re#) 3 



Ei 



whe 



lim limsup||(V)£ , ^|| L 4/ 3 =0. 

Furthermore, for each J we have 

hm || v n (0) - V^(0)\\ H i m = 0. 



(7.18) 
(7.19) 



Proof. We begin with (7.19). By the triangle inequality and the definitions, 
lim K(0) - y/(0)||^ (K2) < lim £ K(0) - #|| h;(b 



o. 



i=i 



To see that the limit vanishes, we note that each of the summands is identically 
zero, except in the case when X J n = 1 and \P n \ —> oo. However, even in this case, 
the difference tends to zero in H^.(R 2 ) by construction. 
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As a preliminary to the main part of the proof, we note that combining (7.15) 
and (O) yields 



lim sup lim sup >^ || Reu^|j? 4 < lim lim E(v 3 n ) < E c 

3 = 1 3=1 



(7.20) 



We now bound the Lf x term in (7.17). By (7.16) and Proposition 5.5, the 
nonlinear profiles decouple in the sense that whenever j ^ j', we have 



lim || Rev J n Rev J n \\ L * (lxl 2) = 0. 
Combining this with (|5.2| ) and then using fl7.2CD shows 



(7.21) 



lim sup lim sup II Re Vlf II f 4 = limsuplimsup 7 || Re-i£||j4 ^E 2 C . (7. 

7 , „ . t , x T , „ , f ^ t , a; 



22) 



J— >oo ?i— >oo 



J— >oo n— >oo 



3=1 



Next, we prove (7.18). A simple computation shows that 

^(0=M(V)- 1 {Re^<(0+Ree^ v >u; n / } 3 - / i(V)- 1 ^(Re<) 3 , 
3=1 ^ 3=1 



and so, by the triangle inequality it suffices to show 

3 / ./ 



lim lim sup 

J— >oo n— >oc 



J 

E Rev » 

3=1 



and lim lim sup 

,7 >oo n — 



We observe that 
J 



3=1 ' 3=1 



(7.23) 



0. (7.24) 



^Re^+e-^l^) -(E Re ^ 

3=1 ' V?'=i 



< |^*(v>, ( /| d , | fi -«(v>„,J| 



E Reu ' 

3=1 



and so (|7.23|) follows from Holder's inequality, (|5.2|) , ([7.201) , and (|7.2l[) . As 



E Re '< 



3=1 



3 j 



E( Re <) ~ E |Re<Re<Re<|, 

3=1 i<ji,h,h<J 
31 #33 



we can use Holder's inequality together with ( 7.2l| ) and ( 7.15| ) to see that (7.24) is 
true, even without sending J — » 00. 

■ — — I 1/2 

Finally, we complete the proof of (7.17) by bounding t he Lf H x ' norm. By the 

Strichartz inequality, ( [7.19| ), and then ([7.5|), ( |7.22| ), and ([7.18Q , 

lim sup lim sup ||V^ || iooJ yi/2 

J->oo n->oo ' x 



< hmsuplimsup{||«„(0)||^ + || ReV n J ||* f + \\{\/}E^\\ i/3 \ 
This completes the proof of ( |7.17 ) and so also the lemma. 



< 00. 



□ 
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By Lemma 7.6, we may apply Proposition 3.4 to conclude that in Case II, v n is 
denned globally and STst(v n ) <£ c 1 for n sufficiently large. This contradicts (7.3) 
and so Case II cannot occur. Tracing back, we see that the only possibility is 
Case IC, and so Proposition [Fj] is proved. □ 

Now we prove the existence of a minimal-energy, almost periodic blowup solution 
to (O). 



Proof of Theorem [l.Q . By the definition of the critical energy and Corollary 3.3 
there exists a sequence u n : R x R 2 — > R of global solutions to (IT) with E(u n ) < 
E c (and M(u n (0)) < M(Q) in the focusing case), Wmn^aa E(u n ) = E c , and 
linin^oo Sk(un) = oo. For each n, we choose t n so that S<t n ( u n) — S>t n (u n ). 
By time-translation invariance, we may assume that t n = 0. We thus have 



lim S< (u n ) = lim S> (u n ) 



oo. 



By Proposition 7.1 , after passing to a subsequence, there exist a sequence {x n } C 
i 2 and a pair of functions (uq, u\) so that 



(T Xn u n (0),T Xn d t u n (0)) -> (uo,Ui), strongly in fT* x L 2 



(7.25) 



The limit then satisfies E(up, Ui) = E c (and M(uq) < 2E C < M(Q) in the focusing 



to fll.lD with 



case). By Corollary 3.3, there exists a global solution u : R X R - 
initial data u(0) = uq and dtu(0) — u\, satisfying 

||«IUffl-i ^ S(tt ,Ul). 

We will show that this solution u satisfies the conclusions of the theorem; it 
remains to be seen that u blows up forward and backward in time and is almost 



(7.26) 



periodic modulo translations. If S>o(u) < oo, then by (7.25) and (7.26), we may 
apply Proposition 3.4 to conclude that 



lim S >0 {u n ) < oo, 

n— >oo 

a contradiction. Therefore u must blow up forward in time, and by a similar 
argument, u must blow up backward in time as well. 

Finally, for almost periodicity modulo translations, we observe that if {t' n } C R 
is any sequence, we have 

S>o(u{- + O) = S< (u(- + t' n )) = oo 



and so by Proposition 7.1, a subsequence of (u(t' n ) , dtu(t' n )) converges in H\ x L 2 
modulo translations. Thus, the orbit {(u(t),dtu(t)) : t £ R} is precompact modulo 
translations. By the Arzela-Ascoli Theorem, this is equivalent to u being almost 
periodic modulo translations in the sense of Definition 1.7. This completes the 
proof of the theorem. □ 



8. Death of a soliton 
In this section, we will preclude the soliton-like solution, thus concluding the 



proof of Theorem 1.4. More precisely, we will prove 



Theorem 8.1 (No soliton). There are no minimal- energy blowup solutions to (1.1) 
that are soliton-like in the sense of Theorem l.i. 
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To prove this theorem, we will argue by contradiction. Let u : M x M 2 — > R 
be a soliton-like solution, that is, a minimal-energy blowup solution that is almost 
periodic modulo translations (and satisfies M(u(0)) < M(Q) in the focusing case). 
Then, invoking (2.34) in the focusing case, 

Nl If Hi + htWlrLi <4E(U). (8.1) 



By Corollary 33, Remark 3.10 , and the minimality of u as a blowup solution, we 
must have that the momentum of u is zero: 

P(u) = 0. (8.2) 



Our next step will be to use (5.2) to control the motion of x(t), which we do in 
the manner of jlj|, ||. 

Lemma 8.2 (Controlling x(t)). The spatial center function of u satisfies \x{t)\ = 
o(t) as \t\ — > oo. 

Proof. By spatial-translation invariance, we may assume that x(0) = 0. We argue 
by contradiction. If the conclusion of the lemma did not hold, then there would 
exist S > and a sequence t n — ¥ ±oo such that 

|;r(*„)| > <5|* n |. 

Without loss of generality, we may assume that t n — > oo and that 
\x(t)\ < \x(t n )\ for all < t < t n . 



Now let r) > be a small constant to be chosen later. By Remark 1.8, there 
exists C{rf) > such that 



sup/ \u(t,x)Y + \Vu(t,x)\ z + \u t (t,x)Y + \u{t,x)\^dx < rj. (8.3) 

We define 

R n :=C( V ) + \x(t n )\. 

Finally, let be a smooth function with <fi(r) = 1 for r < 1 and 4>(r) = for r > 2 
and define an approximation to x(t) by 



x Rn{t) : = / x4>(^)e u (t,x)dx, 

JWL 2 



where e u denotes the energy density of u: 

e u : =I| M |2 + i| Vw | 2 + lH 2 + fM 4 . 
For each n, by the triangle inequality, (|]l]), and ( jslj| ) 

\X Rn (0)\< I \x\\e u (0,x)\dx+ I \x\\e u {0,x)\dx 

J\x\<C(ri) JC(?))<|a;|<2_R Il 

<C(77)^( M )+ryi? n . 



On the other hand, by the triangle inequality followed by (^3) we also have 
\X Rn {t n )\>\x{t n )\E{u)- ( \x-x(t n )\<f>($-)\e u (t n )\dx 



\x-x{t n )\<C( v ) R "' 

\x - x(t n )\4>(^-)\e u (t n )\ dx 



Rn 

x-x(t n )\>C(v) 
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- \x(t n )\ [ [1-0(M)] 1^(^)1 da; 
Jr 2 

> \x(t n )\ [E(u) - 47y] - C(r)) [2E(u) + 2 V ] . 
Thus, taking ry sufficiently small compared to E{u) we get 

\X Rn (t n ) - X Rn (0)\ > E(u) \x(t n )\ - C( V ). (8.4) 
To derive a contradiction, we now seek an upper bound on the left-hand side of 



(p.4|). A computation using (|8.2| ) shows that 

d t X R Jt) = [ \l~^M.)] Ut Vudx- l_ *cl>\\2L)u t x-Vudx. 



Thus by 

\d t X Rn (t)\<r). (8.5) 



Combining (8.4) with ( |8.5| ) and the fundamental theorem of calculus, we obtain 

r)t n > \X Rn (tn) - X Rn (0)\ > E{U) \x(t n )\ - C{rj) > E(u) St n - C(rj). 

Choosing 7/ sufficiently small (depending on S and E(u)) and then choosing n suf- 
ficiently large, we derive a contradiction. □ 



We are now in a position to complete the proof of Theorem 8.1. We will use a 
virial-type argument. 



Let rji > and 772 > be small constants to be chosen later. By Lemma 3.2 
there exists T — To(?7i) > such that 

\x(t)\<riit for all t > T . (8.6) 



By Remark 1.8, there exist C(rji) > and C{rj2) > such that 



sup/ \u(t,x)\* + \Vu(t,x)\ A + \u t (t,x)\ z + \u(t,x)^dx < 771 (8.7) 

t£K ^|a:-x(t)|>C(?7i) 



and 



Using Plancherel and (|8.8|), we find 



sup/ \u(t,Wd£< m . (8.8) 

tSK J\i\<l/C{ m ) 



\u(t,x)\ 2 dx = Ht,0\ 2 d^+ \u(t,0\ 2 dti 

J\ti\<l/C{V2) J\S\>l/C(ri 2 ) 

<m + c{m) 2 I \Vu(t, x )\ 2 dx. (8.9) 



With <j) as in the proof of Lemma |8.2| and 0<£<l<i?tobe specified later, 
we define 



Zn(t) = — / 4>{^;) u t(t, x)x ■ Vu(t, x) dx — (1 — e) / u t (t, x)u(t, x) dx. 
Jr 2 Jr 2 

Note that by Cauchy-Schwarz and ( |8.l| ), 

\Z R {t)\ < RE{u) < u R. (8.10) 
On the other hand, a computation establishes 

d t Z R (t)=e[\\u(t)f Hl + \\u t \\l 2 ] + (l-2s) f \Vu(t)\ 2 + l§\u(t)\ 4 dx 
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2e 



u(t)\ 2 dx~ 1 - [ M ;)|2 _ | tt(t )|2 _ E\ u{t )f] dx 



+ / l 4^'C4) [Mt)\ 2 |V U (t)| 2 - \u(t)\ 2 - f | U (t)| 4 ] da; 



K 2 



^'(\A) [x . Vu{t)?dx . 



Invoking the sharp Gagliardo-Nirenberg inequality, (8.7), and (S.9), we find 
\d t Z R (t)\ > e[\\u(t)f H1 + K||| 2 ] - 2er, 2 - W Vl 



{(l-Jte) 



, M(«(t)) 
1 + ^ M(Q) 



2£C(7 72 ) i 



|Vu(/j)| 2 da; 



for all Tq <t <Ti and i? = C , (77i) + sup tg [ T . o Tl ] Choosing 772 small depending 

on u, then e sufficiently small depending on C(rj2) (and recalling that in the focusing 
case we have M(u(t)) < M(Q)), and finally r\\ small enough depending on e and 
u, we derive 

\d t Z R (t)\> u l for T < t < T x and R = C(rji) + sup (8.11) 

te[T ,Ti] 

) , and then 



.11 



Combining the fundamental theorem of calculus with (8.1C) and 
invoking ([B.6|), we find 

Ti-To^C^O+rftTi for all T x > T . 

Choosing 771 small depending on u and Ti sufficiently large, we derive a contradic- 
tion. 

□ 



This completes the proof of Theorem 3.1 



9. Finite time blowup 

In this section we employ the method of Payne and Sattinger 
Theorem 1.6, whose statement we now repeat: 



to prove 



Theorem 9.1 (Blowup). Let u be a maximal-lifespan solution to (1.1) in the fo- 
cusing case with initial data obeying 

E(u) < E{Q) and M(u(0)) > M{Q). 

Then the solution u blows up in finite time in at least one time direction. 



Proof. Let M(t) := M(u(t)) = J \u{t, x)\ 2 dx. By part (ii) of Proposition |T2 
know that M(t) > M(Q) (and so non-vanishing) and also that 



M"(t) > 6 / \u t (t,x)\ 2 dx. 
Jr 2 

Combining this with the Cauchy-Schwarz inequality we obtain 

[M'(t)] 2 <4^J \u(t,x)\ 2 dxj^J \u t (t,x)\ 2 dxj <lM{t)M"{t) 



and hence, 



dtt M(t) 



_ 1/2 _ 2M"(t)M(t) - 3M'(t) 2 



< 0. 



AM{tfl 2 

This says that M(t)^ 1 / 2 is strictly concave, which is inconsistent with M(t) -1 / 2 
being a positive function defined on the whole real line. In particular, if Af'(0) > 



THE CUBIC KLEIN-GORDON EQUATION IN TWO SPACE DIMENSIONS 



-,<) 



then the solution must blow up in finite time in the future, while if M'(0) < 0, it 
must blow up in finite negative time. □ 



References 

T. Akahori and H. Nawa, Blowup and scattering problems for the nonlinear Schrddinger 
equations. Preprint arXiv : 1006 . 1485. 

H. Bahouri and P. Gerard, High frequency avvroxima tion of soluti ons to critical nonlinear 



MR1705001 



wave equations. Amer. J. Math. 121 (1999), 131-175. 
P. Begout and A. Vargas, Mass concentration phenomena for the L 2 - critical no nlinear 
Schrddinger equation. Trans. Amer. Math. Soc. 359 (2007), 5257-5282. V1R2327030 
J. Bourgain, Global wellposedness of defocusing critical nonlinear Sc hrddinger equation in 



MR1626257 



the radial case. J. Amer. Math. Soc. 12 (1999), 145-171. 
J. Bourgain, Global solutions of nonlinear Schrddinger equations. American Mathematical 
Society Collo quium Publications, 46. American Mathematical Society, Providence, RI, 1999. 



MR169157 



H. Brezis and E. Lieb, A relation between pointwise converge nce of functio ns and convergence 
of functional. Proc. Amer. Math. Soc. 88 (1983), 486-490. MR0699419| 
T. Cazenave, Semilinear Schrddinger e quations. Cou rant Lecture Notes in Mathematics, 10. 



MR2002047 



American Mathematical Society, 2003. 
M. Christ, J. Colliander, and T. Tao, Asymptotics, frequency modulation, and low regularity 
ill-vosedness for canonical defocusing equations. Amer. J. Math. 125 (2003), 1235-1293. 



MR2018661 



J. Colliander, M. Keel, G. Staffilani, H. Takaoka, and T. Tao, Global well-posedness and 
scattering for th e energu- crit ical nonlinear Schrddinger equation in R 3 . Ann. Math. 167 
(2008), 767-865. |MR2415387| 

P. Constantin and J.-C. Saut, Local smooth ing properties of dispersive equations. J. Amer. 



Math. Soc. 1 (1988), 413-439. MR.0928265 



B. Dodson, Global well-posedness and scattering for the defocusing, L 2 -critical, nonlinear 
Schrddinger equation when d>3. Preprint arXiv : 0912 . 2467 

B. Dodson, Global well-posedness and scattering for the defocusing, L 2 -critical, nonlinear 
Schrddinger equation when d = 2. Preprint arXiv : 1006 . 1375 

T. Duyckaerts, J. Holmer, and S. Roudenko, Scattering for the non-radial 3D cubic nonlinear 



MR2470397 



Schrddinger equation. Math. Res. Lett. 15 (2008), 1233-1250. 
J. Ginibre and G. Velo, Time decay of finite energy solutions of the nonlinear Klein-Gordon 
and Schrddinger equations. Ann. Inst. H. Poincare Phys. Theor. 43 (1985), 399-442. 
S. Ibrahim, N. Masmoudi, M. Majdoub, and K. Nakanishi, Scatterin g for the two- dimensional 



MR2569615 



energy- critical wave equation. Duke Math. J. 150 (2009), 287-329. 
S. Ibrahim, N. Masmoudi, and K. Nakanishi, Scattering threshold for the focusing nonlinear 
Klein-Gordon equation. Preprint arXiv : 1001 . 1474. 

M. Keel a.nrl T. Tao, Endpoint Strichartz estimates. Amer. J. Math. 120 (1998), 955-980. 



MR1646046 



C. E. Kenig and F. Merle, Global well-posedness, scattering and blow up for the energy- 
critical, focusing , nonlinear Schrddinger equation in the radial case. Invent. Math. 166 
(2006), 645-675. |MR2257393 

C. E. Kenig and F. Merle, Global well-posedness, scattering and blow-u p for the ener gy- critical 



MR2461508 



focusing non-linear wave equation. Acta Math. 201 (2008), 147-212. 

S. Keraani, On the blow-up vheno menon of the critical nonlinear Schrddinger equation. J. 
Funct. Anal. 235 (2006), 171-192. 



MR2216444 



R.. Killip, S. Kwon, S. Shao, and M. Visan, On the mass-critical generalized KdV equation. 
Preprint arXiv: 0907. 5412. 

R. Killip, T. Tao, and M. Visan, The cubic nonlinear Schrddinger eq uation in two dimensions 



MR2557134 



with radial data. J. Eur. Math. Soc. (JEMS) 11 (2009), 1203-1258. 
R. Killip and M. Visan, Nonlinear Schrddinger equations at critical regularity. To appear 
in proceedings of the Clay summer school "Evolution Equations" , June 23-July 18, 2008, 
Eidgenossische Technische Hochschule, Zurich. 

R. Killip and M. Visan, The focusing energy -critical nonlinear Schrddinger equation in di- 
mensions five and higher. Amer. J. Math. 132 (2010), 361-424. MR265477s| 



GO 



ROWAN KILLIP, BETSY STOVALL, AND MONICA VISAN 



MR247289C 



R. Killip, M. Visan, and X. Zhang, The mass- critical nonlinear Schrddin qer equation with 
radial data in dimensions three and higher. Anal. PDE 1 (2008), 229-266. 
M. K. Kwong, Uniqueness of vosi tive solutions of Au — u + u v = in M 
Mech. Anal. 105 (1989), 243-266. [MR.0969899 



Arch. Rational 



E. H. Lieb and M. Loss, Analysis. Second edition. Gr aduate Stud ies in Mathematics, 14 



MR1817225 



American Mathematical Society, Providence, RI, 2001 
P.-L. Lions, The concentration- compactness principle in the calculus of variation s. The localh ' 
compact case. I. Ann. Inst. H. Poincare Anal. Non Lineaire 1 (1984), 109—145. 



MR0778970 



A. Moyua, A. Vargas, and L. Vega, Restriction theorems and maximal ope rators related to 



MR1671214 



oscillatory integrals in M 3 . Duke Math. J. 96 (1999), 547-574. 
K. Nakanishi, Scattering theory for nonlinear Klein -Gordon equa tion with Sobolev critical 
power. Internat. Math. Res. Notices 1 (1999), 31-60. MR1666973 

K. Nakanishi, Energy scattering for nonlinear Klein-Gordon an d. Schrddinger equations in 

spatial dimensions 1 and 2. J. Funct. Anal. 169 (1999), 201-225. |MR1726753| 

K. Nakanishi, Remarks on the energy scattering f or nonlinear K le.in-Gord.on. and Schrddinger 



MR1829982 



equations. Tohoku Math. J. 53 (2001), 285-303. 
K. Nakanishi, Transfer of global wellposedness from nonlinear Klein- Gordon equat ion to 
nonlinear Schrddinger equation. Hokkaido Math. J. 37 (2008), 749-771. 



MR2474174 



L. E. Payne and D. H. Sattinger, Saddle points and ins tability of nonlinear hyperbolic equa 
Hons. Israel J. Math. 22 (1975), 273-303. 



MR0402291 



E. Ryckman and M. Visan, Global well-posedness and scattering for the defocusing energy- 
critical non linear Schrddinger equation in R 1 " 1 " 4 . Amer. J. Math. 129 (2007), 1—60. 



MR2288737 



P. Siolin, R.eaularitu of solutions to the Schrddinger equation. Duke Math. J. 55 (1987), 



MR090494S 



699-715. 

E. M. Stein, Singular integrals and differentiability properties of func tions. Princet on Math- 



MR0290095 



ematical Series, 30. Princeton University Press, Princeton, NJ, 1970 
E. M. Stein, Some problems in harmonic analysis. In "Harmonic analysis in Euclidean spaces 
(Proc. Sympos. Pure Math., Williams Coll., Williamstown, Mass., 1978), P art 1." Proc 



MR0545235 



Sympos. Pure Math., XXXV, Part 1, Amer. Math. Soc, Providence, R.I., 1979. 
E. M. Stein, Harmonic analysis: real-variable methods, orthogonality, and oscillatory inte- 
grals. Prince ton Mathematical Series, 43. Princeton University Press, Princeton, NJ, 1993 
MR1232192] 



T. Tao, A s harv bilinear restrictions estimate for paraboloids. Geom. Funct. Anal. 13 (2003), 
1359-1384. 



MR2033842 



T. Tao, Two remarks on the genera lised Korteweg-de Vries equation. Discrete Contin. Dyn. 
Syst. 18 (2007), 1-14. |MR2276483 

T. Tao and M. Visan, Stability of energy-critical non linear Schrd dinger equations in high 
dimensions. Electron. J. Diff. Eqns. 118 (2005), 1-28. |MR217455o| 

T. Tao, M. Visan, and X. Zhang, The nonlinear Schrddinger equation with com bined power- 



type nonlinearities. Comm. Partial Differential Equations 32 (2007), 1281-1343. MR2354495 
T. Tao, M. Visan, and X. Zhang, Minim,al-m,a,s s blowup solutions of the mass-critical NLS. 
Forum Math. 20 (2008), 881-919. 



MR2445122 



T. Tao, M. Visan, and X. Zhang, Global well-posedness and scattering for the mass-critical 
nonlinea r Schrddinger equation for radial data in high dimensions. Duke Math. J. 140 (2007), 



MR235507C 



165-202. 

M. Visan, The defocusing energy-critical n onlinear Sch rddinger equation in higher dimen- 
sions. Duke Math. J. 138 (2007), 281-374. MR2318286] 



L. Vega, Schrddinger equa tions: •oointw ise convergence to the initial data. Proc. Amer. Math. 
Soc. 102 (1988), 874-878. 



MR093485E 



M. Weinstein, Nonlinear Schrddi nger equatio ns and sharp interpolation estimates. Comm. 
Math. Phys. 87 (1983), 567-576. |mR0691044 



University of California, Los Angeles 
University of California, Los Angeles 



University of California, Los Angeles 



